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jlSE-QE NF1JRQNAL A POP T OS IS I NHIBITOR PR QTF.IN (NAIP) 
Field pf the Invention 

This invention relates in general to the function of the NAIP inhibitor protein in apoptosis 
and more particularly lb the use of NAIP antibodies, proteins, and nucleic acids to characterize 
NAIP, identify compounds which modulate NAIP. and diagnose and treat conditions affected by 
changes in NAIP levels. 

Back ground of the Invention 

Apoptosis is a morphologically distinct form of programmed cell death that is important 
in the normal development and maintenance of multicellular organisms. Dysregulation of 
apoptosis can take the form of inappropriate suppression of cell death, as occurs in the 
development of some cancers, or in a failure to control the extent of cell death, as is believed to 
occur in acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). 

Childhood spinal muscular atrophies are neurodegenerative disorders characterized by 
progressive spinal cord motor neuron depletion and are among the most common autosomal 
recessive disorders (Dubowitz, V. 1978, Brooke, MA. 1986). Type I SMA is the most frequent 
inherited cause of death in infancy. The loss of motor neurons in SMA, has led to suggestions 
that an inappropriate continuation or reactivation of normally occurring motor neuron apoptosis 
may underlie the disorder (Sarnat, H.B, 1992). NAIP, a gene associated with SMA, has been 
mapped to human chromosome 5ql3.1 

Some baculoviruses encode proteins that are termed inhibitors of apoptosis proteins 
(IAPs) because they inhibit the apoptosis that would otherwise occur when insect cells are 
infected by the virus. These proteins are thought to work in a manner that is independent of 
other viral proteins. The baculovirus 1AP genes include sequences encoding a ring zinc finger- 
like motif (RZF), which may be involved in DNA binding, and two N-terminal domains that 
consist of a 70 amino acid repeat motif termed a BIR domain (Baculovirus IAP Repeat). 

1 
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Summary of the Invention 

We have discovered uses for NAIP proteins, nucleic acids, and antibodies for the 
detection and treatment of conditions involving apoptosis. Furthermore, we have discovered a 
novel NAIP sequence and a NAIP fragment with enhanced anii-apoptotic activities. 

In general, the invention features a substantially pure nucleic acid molecule, such as a 
genomic, cDNA, anlisense DNA, RNA, or a synthetic nucleic acid molecule, that encodes or 
corresponds to a mammalian NAIP polypeptide. This nucleic acid may be incorporated into a 
vector. Such a vector may be in a cell, such as a mammalian, yeast, nematode, or bacterial cell. 
The nucleic acid may also be incorporated into a transgenic animal or embryo thereof. In 
preferred embodiments, the nucleic acid molecule is a human NAIP nucleic acid. In most 
preferred embodiments the NAIP gene is a human NAIP gene. In other various preferred 
embodiments, the cell is a transformed cell. 

According to one preferred embodiment, the nucleic acid sequence includes the cDNA 
sequences encoding exons 14a and 17. In a more preferred embodiment the sequence includes 
exons 1-14, 14a, and IS- 1 7. In the most preferred embodiments the sequence also includes the 
complete 5' and 3 1 untranslated regions of the NAIP gene and is represented as Seq. ID No. 2, 21, 
or 23, most preferably, as in Seq. ID No. 21. In other preferred embodiments, the nucleic acid is 
a purified nucleotide sequence comprising genomic DNA, cDNA, mRNA, anti-sense DNA or 
other DNA substantially ideniical to the cDNA sequences of Seq. ID No. 2, 21, or 23 
corresponding to the cDNA sequences of the invention. Most preferably exons 1 to 14 and 14a 
to 17 are as described in Seq. ID No. 21. 

In specific embodiments, the invention features nucleic acid sequences substantially 
identical to the sequences shown in Fig. 21, or fragments thereof. In another aspect, the 
invention also features RNA which is encoded by the DNA described herein. Preferably, the 
RNA is mRNA. In another embodiment the RNA is antisense RNA that is complementing to the 
coding strand of NAIP. 

In a second aspect of the invention, the NAIP encoding nucleic acid comprises at least the 
3 BIR domains of a NAIP sequence provided herein (e.g., nucleotides 1-1360 of the NAIP 
sequence provided in Fig. 6), but lacks at least some of the sequences encoding the carboxy 
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terminus of the NA1P polypeptide. Preferably, at least 30 nucleic acids are deleted from the 
region of the NA1P gene between nucleic acids 1 360 (i.e.. the end or the BIR domains) 4607 
(i.e., the end of the coding sequence) of the NA1P sequence shown in Fig. 6, Seq. ID No. 21. 
More preferably, al least 100 nucleotides are deleted, and even more preferably at least 1000 
nucleotides arc deleted. In the most preferred embodiment, up to 3247 nucleotides are. deleted. 
Preferably, the deletion results in a statistically significant increase in the anti-apoptotic activity 
of the encoded protein on one of the assays provided herein. 

In a third aspect, the invention features a substantially pure DNA which includes a 
promoter capable of expressing or activating the expression of the NA1P gene or fragments 
thereof in a celt susceptible to apoptosis. In preferred embodiments of this aspect, the NAIP 
gene is human NAIP or fragments thereof, as described above. In further preferred embodiments 
of this aspect of the invention, the promoter is the promoter native to the NA1P gene. 
Additionally, transcriptional and translation^ regulatory regions are, preferably, those native to a 
NAJP gene. 

In another aspect, the invention provides transgenic cell lines, including the NAIP nucleic 
acids of the invention. The transgenic cells of the invention are preferably cells that are altered 
in their apoptotic response. In preferred embodiments, the transgenic mammalian cell is a 
fibroblast, neuronal cell, a pulmonary cell, a renal cell, a lymphocyte cell, a glial cell, a 
myocardial cell, an embryonic stem cell, or an insect cell. Most preferably, the neuron is a motor 
neuron and the lymphocyte is a CD4* T cell. 

In another related aspect, the invention features a method of altering the level of 
apoptosis that involves producing a transgenic cell having a transgene encoding a NAIP 
polypeptide or antisense nucleic acid. The transgene is integrated into the genome of the cell in a 
way that allows for expression. Furthermore, the level of expression in the cell is sufficient to 
alter the level of apoptosis. In preferred embodiments the transgene is in a motor neuron or a 
myocardial cell. 

In yet another related aspect, the invention features a transgenic animal, preferably a 
mammal, more preferably a rodent, and most preferably a mouse, having a NAIP gene as 
described above inserted into the genome (mutant or wild-type), or a knockout of a NAIP gene in 
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the genome, or both. A transgenic animal expressing NAIP antisense nucleic acid is also 
included. The transgenic animals may express either an increased or a decreased amount of 
NAIP polypeptide, depending on the construct used and the nature of the genomic alteration. For 
example, utilizing a nucleic acid molecule that encodes all or part of a NAIP to engineer a 
knockout mutation in a NAIP gene would generate an animal with decreased expression of either 
all or part of the corresponding NAIP polypeptide. In contrast, inserting exogenous copies of all 
or part of a NAIP gene into the genome, preferably under the control of active regulatory and 
promoter elements, would lead to increased expression or the corresponding NAIP polypeptide. 

In another aspect, the invention features a method of detecting a NAIP gene in a cell by 
detecting the NAIP gene, or a portion thereof (which is greater than 9 nucleotides, and preferably 
greater than 1 8 nucleotides in length), with a preparation of genomic DNA from the celt The 
NAIP gene and the genomic DNA are brought into contact under conditions that allow for 
hybridization (and therefore, detection) of nucleic acid sequences in the cell that are at least 50% 
identical to the DNA encoding the NAIP polypeptides. Preferably, the nucleic acid used 
comprised at least a part of exon 14a or exon 17, as provided in Figs. 6 and 7. 

In another aspect, the invention features a method of producing a NAIP polypeptide in 
vivo or in vitro. In one embodiment, this method involves providing a cell with nucleic acid 
encoding all or part of a NAIP polypeptide (which is positioned for expression in the cell), 
culturing the cell under conditions that allow for expression of the nucleic acid, and isolating the 
NAIP polypeptide. In preferred embodiments, the NAIP polypeptide is expressed by DNA that 
is under the control of a constitutive or inducible promoter. As described herein, the promotor 
may be a native or heterologous promotor. in preferred embodiments the nucleic acid comprises 
exon 14a or exon 17, Most preferably the nucleic acid is the nucleic acid shown in either Fig, 6 
or Fig. 7. Most preferably, it is the sequence of Fig. 6. 

In another aspect, the invention features substantially pure mammalian NAIP 
polypeptide. Preferably, the polypeptide includes an amino acid sequence that is substantially 
identical to one of the amino acid sequences shown in any one of Figs. 6 or 7. Most preferably, 
the polypeptide is the human NAIP polypeptide of Fig. 6. Fragments including at least two B1R 
domains, as provided herein, are also a part of the invention. Preferably, the fragment has at least 
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three BIR domains. For example, polypeptides encoded by the nucteic acids described above 
having deletions between nucleic acids 1360 and the end of the gene are a part of the invention. 
In one embodiment, the NA1P fragments included those NA1P fragments comprising at least 1 5 
sequential amino acids ofSeq. ID No. 22 or 24. Most preferably the fragment includes at least a 
portion of exon 14a or exon 17. 

In another aspect, the invention features a recombinant mammalian polypeptide derived 
from NAIP that is capable of modulating apoptosis. The polypeptide may include at least two 
BIR domains as defined herein, preferably three BIR domains. In preferred embodiments, the 
NAIP amino acid sequence differs from the NAIP sequences of Figs. 6 or 7 by only conservative 
substitutions or differs from the sequences encoded by the nucleic acids of Seq, ID Nos, 1.2, 21 
or 23 by deletions of amino acids carboxy terminal to the BIR domains. In other preferred 
embodiments the recombinant protein decreases apoptosis relative to a control by at least 5%, 
more preferably by 25%. 

In another aspect, the invention features a method of inhibiting apoptosis in a mammal 
wherein the method includes: providing nucleic acid encoding a NAIP polypeptide to a cell that 
is susceptible to apoptosis; wherein the nucleic acid is positioned for expression in the cell; NAIP 
gene is under the control of regulatory sequences suitable for controlled expression of the 
gene(s); and the NAIP transgene is expressed at a level sufficient to inhibit apoptosis relative to a 
cell lacking the NAIP transgene. The nucleic acid may encode all or part of a NAIP polypeptide. 
It may, for example, encode two or three BIR domains, but have a deletion of the carboxy- 
terminal amino acids. Preferably, the nucleic acid comprises sequences encoding exon 14a, exon 
17, or both. 

In a related aspect, the invention features a method of inhibiting apoptosis by producing a 
ceil that has integrated, into its genome, a transgene that includes the NAIP gene, or a fragment 
thereof. The NAIP gene may be placed under the control of a promoter providing constitutive 
expression of the NAIP gene. Alternatively, the NAIP transgene may be placed under the control 
of a promoter that allows expression of the gene to be regulated by environmental stimuli. For 
example, the NAIP gene may be expressed using a tissue-specific or cell type-specific promoter, 
or by a promoter that is activated by the introduction of an external signal or agent, such as a 
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chemical signal or agent, In preferred embodiments the mammalian cell is a lymphocyte, a 
neuronal cell, a glial cell, or a fibroblast. In oiher embodiments, the cell in an HIV-infected 
human, or in a mammal suffering from a neurodegenerative disease, an ischemic injury, a toxin-, 
induced liver disease, or a mvelodysplastic syndrome. 

In a related aspect, the invention provides a melhod of inhibiting apoptosis in a mamma! 
by providing an apoptosis-inhibiting amount of NA1P polypeptide. The NAIP polypeptide may 
be a full-length polypeptide, or it may be one of the fragments described herein. 

In another aspect, the invention features a purified antibody that binds specifically to a 
NAIP protein. Such an antibody may be used in any standard immunodetection method for the 
detection, quantification, and purification of a NAIP polypeptide. Preferably, the antibody binds 
specifically to NAIP. The antibody may be a monoclonal or a polyclonal antibody and may be 
modified for diagnostic or for therapeutic purposes. The most preferable antibody binds the 
NAIP polypeptide sequences of Seq. ID Nos. 22 and/or 24, but not the NAIP polypeptide 
sequence disclosed in PCT/CA95/00581. 

The antibodies of the invention may be prepared by a variety of methods. For example, 
the NAIP polypeptide, or antigenic fragments thereof, can be administered to an animal in order 
io induce the production of polyclonal antibodies. Alternatively, antibodies used as described 
herein may be monpclonal antibodies, which are prepared using hybridoma technology (sec, e.g., 
Kohler et al„ Nature 256:495, 1975; Kohler et al., Eur, J. Immunol. 6:511, 1976; Kohler et at, 
Eur. J. Immunol. 6:292, 1976; Hammerling et al., In Monoclonal Antibodies and T Cell 
Hybridomas, Elsevier, NY, 1981). The invention features antibodies that specifically bind 
human or murine NAIP polypeptides, or fragments thereof. In particular, the invention features 
"neutralizing 1 ' antibodies. By "neutralizing" antibodies is meant antibodies that interfere with 
any of the biological activities of the NAIP polypeptide, particularly the ability of NAIP to 
inhibit apoptosis. the neutralizing antibody may reduce the ability of NAIP polypeptides to 
inhibit apoptosis by, preferably 50%, more preferably by 70%, and most preferably by 90% or 
more. Any standard assay of apoptosis, including those described herein, may be used to assess 
potentially neutralizing antibodies. 
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In addition to intact monoclonal and polyclonal anti-NAlP antibodies, the invention 
features various genetically engineered antibodies, humanized antibodies, and antibody 
fragments, including F(ab')2, Fab 1 . Fab. Fv and sFv fragments. Antibodies can be humanized by 
methods known in the art. e.g.. monoclonal antibodies with a desired binding specificity can be 
commercially humanized (Scotgene. Scotland; Oxford Molecular, Palo Alto. CA). Fully human 
antibodies, such as those expressed in transgenic animals, are also features of the invention 
(Green et al.. Nature Genetics 7:13-21, 1994). 

Ladner (U.S. Patent 4,946,778 and 4,704,692) describes methods for preparing single 
polypeptide chain antibodies. Ward et al. (Nature 341 :544-546, 1989) describe the preparation 
of heavy chain variable domains, which they term "single domain antibodies," which have high 
antigen-binding affinities. McCafTerty et al. (Nature 348:552-554, 1 990) show that complete 
antibody V domains can be displayed on the surface of fd bacteriophage, that the phage bind 
specifically to antigen, and that rare phage (one in a million) can be isolated after affinity 
chromatography. Boss et al. (U.S. Patent 4,816,397) describe various methods for producing 
immunoglobulins, and immunologically functional fragments thereof, which include al least the 
variable domains of the heavy and light chain in a single host cell. Cabilly et al. (U.S. Patent 
4,816,567) describe methods for preparing chimeric antibodies. 

In another aspect, the invention features a method of identifying a compound that 
modulates apoptosis. The method includes providing a cell expressing or capable of expressing a 
NA1P polypeptide, contacting the cell with a candidate compound, and monitoring the 
expression of the NAIP gene or a reporter gene linked to the NAJP gene regulatory sequences, or 
by monitoring NAJP biological activity. An alteration in the level of expression of the NAIP 
gene indicates the presence of a compound which modulates apoptosis. The compound may be 
an inhibitor or an enhancer of apoptosis. In various preferred embodiments, the mammalian cell 
is a myocardial cell, a fibroblast, a neuronal cell, a glial cell, a lymphocyte (T cell or B cell), or 
an insect cell. 

In a related aspect, the invention features methods of detecting compounds that modulate 
apoptosis using the interaction trap technology and NAIP polypeptides, or fragments thereof, as a 
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component of the bait. In preferred embodiments, the compound being tested as a modulator of 
apoptosis is also a polypeptide. 

In a related aspect, the invention features a method for analyzing the anti-apoptotic effect 
of a candidate NAIP is provided comprising, i) providing an expression vector for the expression 
of the candidate NAIP; ii) transfecting mammalian cells with said expression vector; iii) inducing 
the transformed cells to undergo apoptosis; and iv) comparing the survival rate of the cells with • 
• appropriate mammalian cell controls. 

In yet another aspect, the invention features a method for detecting the expression of 
NAIP in tissues comprising, i) providing a tissue or cellular sample; ii) incubating said sample 
with an anti-NAIP polyclonal or monoclonal antibody; and iii) visualizing the distribution of 
NAIP. 

in another aspect, the invention features a method for diagnosing a cell proliferation 
disease, or an increased likelihood of such a disease, using a NAIP nucleic acid probe or NAIP 
antibody. Preferably, the disease is a cancer of the central nervous system. Most preferably, the 
disease is selected from the group consisting of neuroblastoma, meningioma, glialblastoma, 
astracystoma, neuroastrocytoma, promyelocyte leukemia, a HeLa-type carcinoma, chronic 
myelogenous leukemia (preferably using xiap or hiap-2 related probes), lymphoblastic leukemia 
(preferably using a xiap related probe), Burkitt's lymphoma, colorectal adenocarcinoma, lung 
carcinoma, and melanoma. Preferably, a diagnosis is indicated by a 2-fold increase in expression 
or activity, more preferably, at least a 1.0- fold increase in expression or activity. 

In another aspect, the invention includes a method of treating a patient having deleterious 
levels apoptosis. Where the patient has more apoptosis than desirable or is otherwise deficient in 
normal N AIP, the method includes the step of administering to said patient a therapeutically 
effective amount of NAIP protein, NAJP nucleic acid, or a compound which enhances NAIP 
activity levels in a form which allows delivery to the cells which are undergoing more apoptosis 
than is therapeutically desirable. In one preferred embodiment, the cell having deleterious levels 
of apoptosis is a myocardial cell in a patient diagnosed with a cardiac condition. 

Where insufficient levels of apoptosis are likely to occur, anltsense NAIP nucleic acid, 
NAIP antibody, or a compound which otherwise decreases NAIP activity levels may be 
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administered. Treatment ofSMA is specifically excluded from the invention. Thus, apoptosis 
may be induced in a cell by administering lo the cell a negative regulator of the NAlP-dependem 
anti-apoptolic pathway. The negative regulator may be, but is not limited to, a NAIP polypeptide 
fragment or purified NAIP specific antibody. For example, the antibody may bind to an epitope 
in any one of the three BIR domains. The negative regulator may also be a NAIP antisense RN A 
molecule. 

Skilled artisans will recognize that a mammalian NAJP, or a fragment thereof (as 
described herein), may serve as an active ingredient in a therapeutic composition. This 
composition, depending on the NAIP or fragment included, may be used to modulate apoptosis 
and thereby treat any condition that is caused by a disturbance in apoptosis. Thus, it will be 
understood that another aspect of the invention described herein, includes the compounds of the 
invention in a pharmaceutical^ acceptable carrier. 

As summarized above, a NAIP nucleic acid, polypeptide, or antibody may be used to 
modulate apoptosis. Furthermore, a NAIP nucleic acid, polypeptide, or antibody may be used in 
the discovery and/or manufacture of a medicament for the modulation of apoptosis. 

By "NAIP gene" is meant a gene encoding a polypeptide having at least exon 1 4a or exon 
17 Figs. 6 or 7, or the sequence of Fig. 5, Seq. ID No. 1, wherein at least 10 carboxy-terminal 
nucleic acids have been deleted to enhance activity, as described above. In preferred 
embodiments the NAIP gene encodes a polypeptide which is capable of inhibiting apoptosis or 
eliciting antibodies which specifically bind NAJP. In preferred embodiments the NAIP gene is a 
gene having about 50% or greater nucleotide sequence identity to the NAIP amino acid encoding 
sequences of Figs, 6 or 7. In another preferred embodiment, the NAJP gene encodes a fragment 
sufficient to inhibit apoptosis. Preferably, the region of sequence over which identity is 
measured is a region encoding exon 1 4a or exon 17. Mammalian NAIP genes include nucleotide 
sequences isolated from any mammalian source. Preferably, the mammal is a human. 

The term "NAIP gene" is meant to encompass any NAIP gene, which is characterized by 
its ability to modulate apoptosis and encodes a polypeptide that has at least 20%, preferably at 
least 30%, and most preferably at least 50% amino acid sequence identity with the NAJP 
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polypeptides shown in Figs. 6 and 7. Specifically excluded is the full length sequence disclosed 
in PCT/CA95/00581 and shown in Seq. ID No. j . 

By H NA1P protein" or "NA1P polypeptide" is meant a polypeptide, or fragment thereof, 
encoded by a NA1P gene as described above. 

By "modulating apoptosis" or "altering apoptosis" is meant increasing or decreasing the 
number of cells that would otherwise undergo apoptosis in a given cell population. Preferably, 
the cell population is selected from a group including T cells; neuronal cells, fibroblasts, 
myocardial cells, or any other cell line known to undergo apoptosis in a laboratory setting (e.g., 
the baculovirus infected insect cells). It will be appreciated that the degree of modulation 
provided by a NA1P or a modulating compound in a given assay will vary, but that one skilled in 
the art can determine the statistically significant change in the level of apoptosis which identifies 
a NAIP or a compound which modulates a NAIP. 

By "inhibiting apoptosis" is meant any decrease in the number of cells which undergo 
apoptosis relative to an untreated control Preferably, the decrease is at least 25%, more 
preferably the decrease is 50%, and most preferably the decrease is at least one-fold. 

By "polypeptide" is meant any chain of more than two amino acids, regardless of post- 
radiational modification such as glycosylation or phosphorylation. 

By "substantially identical" is meant a polypeptide or nucleic acid exhibiting at least 
50%, preferably 85%, more preferably 90%, and most preferably 95% homology to a reference 
amino acid or nucleic acid sequence. For polypeptides, the length of comparison sequences will 
generally be at least 16 amino acids, preferably at least 20 amino acids, more preferably at least 
25 amino acids, and most preferably 35 amino acids. For nucleic acids, the length of comparison 
sequences will generally be at (east 50 nucleotides, preferably at least 60 nucleotides, more 
preferably at least 75 nucleotides, and most preferably 110 nucleotides. 

Sequence identity is typically measured using sequence analysis software with the default 
parameters specified therein (e.g., Sequence Analysis Software Package of ihe Genetics 
Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, 
Madison, Wl 53705). This software program matches similar sequences by assigning degrees of 
homology to various substitutions, deletions, and other modifications. Conservative 
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substitutions typically include substitutions within the following groups: glycine, alanine, valine, 
isoleucine, leucine; aspartic acid, glutamic acid, asparaginc, glutaminc; serine, threonine; lysine, 
arginine; and phenylalanine, tyrosine. 

By "substantially pure polypeptide" is meani a polypeptide that has been separated from 
the components that naturally accompany it. Typically, the polypeptide is substantially pure 
when it is at least 60%, by weight, free from the.pweins and naturally-occurring organic 
molecules with which it is naturally associated. Preferably, the polypeptide is a N AIP 
polypeptide that is at least 75%, more preferably at least 90%, and most preferably at least 99%, 
by weight, pure. A substantially pure NAP polypeptide may be obtained, for example, by 
extraction from a natural source (e.g. a fibroblast, neuronal cell, or lymphocyte) by expression of 
a recombinant nucleic acid encoding a NAIP polypeptide, or by chemically synthesizing the 
protein. Purity can be measured by any appropriate method, e.g., by column chromatography, 
polyaco'lamide gel electrophoresis, or HPLC analysis. 

A protein is substantially free of naturally associated components when it is separated 
from those contaminants which accompany it in its natural state. Thus, a protein which is 
chemically synthesized or produced in a cellular system different from the cell from which it 
naturally originates wilt be substantially free from its naturally associated components. 
Accordingly, substantially pure polypeptides include those derived from eukaryotic organisms 
but synthesized in £. coii or other prokaryoles. By "substantially pure DNA n is meant DNA that 
is free of the genes which, in the naturally-occurring genome of the organism from which the 
DNA of the invention is derived, flank the gene. The term therefore includes, for example, a 
recombinant DNA which is incorporated into a vector; into an autonomously replicating plasmid 
or virus; or into the genomic DNA of a prokaryote or eukaryotc; or which exists as a separate 
molecule (e.g., a cDN A or a genomic or cDNA fragment produced by PCR or restriction 
endonucleasc digestion) independent of other sequences. It also includes a recombinant DNA 
which is part of a hybrid gene encoding additional polypeptide sequence. 

By "transformed cell" is meant a cell into which (or into an ancestor of which) has been 
introduced, by means of recombinant DNA techniques, a DNA molecule encoding (as used 
herein) a NAIP polypeptide. 
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By "transgenic" is meant any piece of DNA which is inserted by artifice into a ceih and 
becomes pan of the genome of the organism which develops from thai cell. Such a transgene 
may include a gene which is partly or entirely heterologous (i.e„ foreign) to the transgenic 
organism, or may represent a gene homologous to an endogenous gene of the organism. 

By "transgenic" is meant any cell which includes a DNA sequence which is inserted by 
artifice into a cell and becomes part of the genome of the organism which develops from that 
cell. As used herein, the transgenic organisms are generally transgenic mammalian (e.g., rodents 
such as rats or mice) and the DNA (transgene) is inserted by artifice into the nuclear genome. 

By "transformation" is meant any method for introducing foreign molecules into a cell. 
Lipofection, calcium phosphate precipitation, retroviral delivery, electroporation, and biolistic 
transformation are just a few of the teachings which maybe used. For example, biolistic 
transformation is a method for introducing foreign molecules into a cell using velocity driven 
microprojectiles such as tungsten or gold particles. Such velocity-driven methods originate from 
pressure bursts which include, but are not limited lo, helium-driven, air-driven, and gunpowder* 
driven techniques. Biolistic transformation may be applied to the transformation or transfection 
of a wide variety of cell types and intact tissues including, without limitation, intracellular 
organelles {e.g., and mitochondria and chloroplasts), bacteria, yeast, fungi, algae, animal tissue, 
and cultured cells. . 

By "positioned for expression" is meant that the DNA molecule is positioned adjacent to 
a DNA sequence which directs transcription and translation of the sequence (i.e., facilitates the 
production of; e.g., a NA1P polypeptide, a recombinant protein or a RNA molecule). 

By "reporter gene" is meant a gene whose expression may be assayed; such genes 
include, without limitation, glucuronidase (GUS), luciferase, chloramphenicol transacetylase 
(CAT), and p-galactosidase, and green fluorescent protein (GFP). 

By "promoter" is meant minimal sequence sufficient to direct transcription. Also 
included in the invention are those promoter elements which are sufficient to render promoter- 
dependent gene expression controllable for cell type-specific, tissue-specific or inducible by 
external signals or agents; such elements may be located in the 5' or 3 1 regions of the native gene 
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By "operably linked" is meant that a gene and one or more regu I a lory sequences are 
connected in such a way as to permit gene expression when the appropriate molecules (e.g., 
transcriptional activator proteins are bound to the regulatory sequences). 

By "conserved region" is meant any stretch of six or more contiguous amino acids 
exhibiting at least 30%, preferably 50%, and most preferably 70% amino acid sequence identity 
between two or more of the NAIP family members, (e.g., between human NA1P and murine 
NAIP). 

By "carboxy terminal amino acids of NAIP" is meant the amino acids of carboxy 
terminal to the three BIR domains of the NAIP gene. For example r the amino acids encoded 
beyond nucleic acid 1360 of Seq. ID. No. 21 are carboxy terminal. 

By "detectably-labelled" is meant any means for marking and identifying the presence of 
a molecule, e.g., an oligonucleotide probe or primer, a gene or fragment thereof, or a cDNA 
molecule. Methods for detectably-labelling a molecule are well known in the art and include, 
without limitation, radioactive labelling (e.g., with an isotope such as 33 P or J5 S) and 
nonradioactive labelling (e.g., cherniiuminescent labelling, e.g., fluorescein labelling). 

By "antisense," as used herein in reference to nucleic acids, is meant a nucleic acid 
sequence, regardless of length, that is complementary to the coding strand of a gene. 

By "purified antibody" is meant antibody which is at least 60%, by weight, free from 
proteins and naturally occurring organic molecules with which it is naturally associated. 
Preferably, the preparation is at least 75%, more preferably 90%, and most preferably at least 
99%, by weight, antibody, e.g., a NAIP specific antibody. A purified antibody may be obtained, 
for example, by affinity chromatography using recombinantly-produced protein or conserved 
motif peptides and standard techniques. 

By "specifically binds" is meant an antibody lhat recognizes and binds a protein but that 
does not substantially recogn^ and bind other molecules in a sample, e.g., a biological sample, 
lhat naturally includes protein. The preferred antibody binds to the NAIP peptide sequence of 
sequence ID No. 2 but does not bind to the NAIP sequence disclosed in PCT/CA 95/00581 . 

Other features and advantages of the invention will be apparent from the following 
description of the preferred embodiments thereof, and from the claims. 
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Brief Pwripugn orthg-.Dr&winfts 

Various aspects of the invention arc described with respect to the drawings wherein: 

Fig. I . shows expression of NAIP in HeLa, CHO and Rat- 1 pooled stable lines and 
adenovirus infected ceils analysed by Western blotting (A-D) and immunofluorescence. A-B are 
cells infected with adenovirus encoding NAIP-myc detected by a mouse anti-myc monoclonal 
antibody or by a rabbit and human NAIP polyclonal antibody. C cells infected with adenovirus 
encoding NAIP detected by the NAIP polyclonal antibody, D expression of myc-NAJP in 
representative pooled cell lines by immunofluorescence detected with antibodies against myc. E- 
F ral-1 NAIP transfectants detected by E anti-myc and F anli-NAIP antibodies. 

Fig. 2 . shows the effect of NAIP on ceil death induced by serum deprivation, menadione 
and TNF-a. Viability of a CHO cells deprived of serum in A t adenovirus infected cells and B, 
pooled transformants. C-H T ceil death induced by menadione in adenvirus infected CHO (C, D) 
and Rat-1 (E t F and G, H) adenovirus infected cells and pooled transibrmanls respectively. 1, 
adenovirus infected and J, pooled transformants of TNF-a/cyclohexamide treated HeLa cells. 

Fig, 3> shows immunofluorescence analysis of human spinal cord tissue. A, Anterior 
hom cells. B, Intermediolateral neurons. C, Dorsal roots. D, Ventral roots. 

Fig. A depicts the genomic structure of PAC I25D9 from human chromosome 5ql3.1. 
Both strands of the 13 1 ,708 bp region shown in the figure have been sequenced and can be found as 
GenBank accession 8U800I7. Notl (N), EcoRJ (E), Hindlll (H) and BainHI {B) sites are indicated. 
The exons of BTF2p44 (green), NAIP (red) and SMN (grey) are represented above by numbered 
color boxes. The transcribed (but not translated) CCA sequence is indicated by the light green box. 
The number of nucleotides which a specific region spans is as indicated, e.g. the gap between NAIP 
and SMN is 1 547 1 bp. The minimal tiling pattern of plasmid clones covering the PAC is shown 
below. The letters at the beginning of each clone indicate the restriction enzymes used for preparing 
the plasmid libraries, except for IC6, 2A8 and 2E2 which are clones from the partial Sau3AI 
libraries. (Sstl-S). The location and orientation of eight classes of repeat sequences found using the 
N1H Sequin program are depicted by color triangles . The names of the repeats represented by 
different colors are shown at the top right of the figure. Promotor sequences as detected by GRAIL 
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(red arrow) or Prestridge (Prestidge, D. S. JMoi Bioi 249, 923-932 ( 1995) (green arrow) 
programs and CpG islands arc shown as arrows or blue blocks respectively above the bar. 

Fig, 5 shows the sequences obtained in 2 separate sequencings of ihe NAIP gene. 

Fig, 6 shows a preferred NAIP cDNA sequence and the predicted NAIP polypeptide 
sequence. 

Fig. 7 shows a NAIP sequence including the intron-exon boundaries. (Seq. ID No. 23). 
Detailed Des rnphon of the Preferred Embodiment 

Although the precise site and mechanism of NAIP's anti-apoptotic effect is unknown, it is 
now demonstrated that NAIP is clearly involved in apoptotic pathways in mammalian cells. In 
addition, immunofluorescence localization indicates that NAIP is expressed in motor, but not 
sensory neurons. These findings are in keeping with the protein acting as a negative regulator of 
apoptosis, most particularly neuronal apoptosis and, when deficient or absent, contributes to the 
neurodegenerative phenotypcs such as SMA and ALS. 

I. The NAIP gen? 

There are two nearly identical copies of NAJP on 5ql3.1. The complete NAIP gene, shown 
in Fig. 6, contains 18 exons (1 to 14, and 14a lo 17) and spans an estimated 90 kb of genomic DNA. 
(Other intermediate sequences obtained are shown in Figs. 5 and 7). The NAIP coding region spans 
4212 nucleotides resulting in a predicted gene product of 1404 amino acids (Seq. ID No. 22). The 
total length of the NAJP gene spans 6228 nucleotides (Seq. ID No. .21) with a 395 nucleotide 5' 
UTR and a 1621 nucleotide 3* UTR. The complete sequence, Sequence ID No.2, allows one skilled 
in the art to develop probes and primers for the identification of homologous sequences and for the 
identification of mutations within the DNA. Both 5' and 3' regions may also prove useful as 
encoding binding sites for agents which may up or down-regulate the gene further delineating the 
NAJP pathway and function. The sequences identified as Seq. ID No. 2 and 23 are also useful for 
protein expression in appropriate vectors and hosts to produce NAIP and study its funciion as well 
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as to develop antibodies. Sequencing of the PAC 125D9 154 kb, which was identified as a likely 
site of the SMA gene, resulted in the identification of (he NAJP sequence shown in Fig. 5. Seq. ID 
No. 1. An additional coding sequence, exon 14a, has since been identified and is provided herewith. 
The NAIP DNA sequence containing exon 14a appears to be a predominant gene iso form which is 
not deleted or mutated in SMA patients. The techniques and primers used for the isolation and 
application of exon 14a from the human fetal spinal cord cDNA libraries was as described for the 
identification of the other exons and detailed in Example 4. Additional screening of cDNA libraries 
combined with analysis of PAC 125D9 genomic DNA sequence has resulted in the identification of 
a novel 3 1 end of NAIP which includes additional exon 17 sequence. 

II. Symtttsis of NAIP 

The characteristics of the cloned NAIP gene sequence may be analyzed by introducing the 
sequence into various cell types or using in vitro extracellular systems. The function of the NAIP 
may then be examined under different physiological conditions. The NAJP DNA sequence may be 
manipulated in studies to understand the expression of the gene and gene product. Alternatively, cell 
lines may be produced which overexpress the gene product allowing purification of NAIP for 
biochemical characterization, large-scale production, antibody production, and patient therapy. 

For protein expression, eukaryotic and prokaryotic expression systems may be generated in 
which the NAIP gene sequence is introduced into a plasmid or other vector which is then 
introduced into living cells. Constructs in which the NAJP cDNA sequence containing the entire 
open reading frame inserted in the correct orientation into an expression plasmid may be used for 
protein expression. Alternatively, portions of the sequence, including wild-type or mutant NAIP 
sequences, may be inserted. Prokaryotic and eukaryotic expression systems allow various important 
functional domains of the protein to be recovered as fusion proteins and then used for binding, 
structural and functional studies and also for the generation of appropriate antibodies. If a NAIP 
increases apoptosis, it may be desirable to express that protein under control of an inducible 
promoter. 
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Typical expression vectors contain promoters that direct the synthesis oflarge amounts of 
mRNA corresponding to the gene. They .may also include sequences allowing for their autonomous 
replication within the host organism, sequences that encode genetic traits that allow cells containing 
the vectors to be selected, and sequences that increase the efficiency with which the mRNA is 
translated. Some vectors contain selectable markers such as neomycin resistance that permit 
isolation of* cells by growing them under selective conditions. Stable long-term vectors may be 
maintained as freely replicating entities by using regulatory elements of viruses. Cell lines may also 
be produced which have integrated the vector into the genomic DN A and in this manner the gene 
product is produced on a continuous basis. 

Expression of foreign sequences in bacteria such as E.coli require the insertion of the NAIP 
sequence into an expression vector, usually a bacteria! plasmid. This plasmid vector contains 
several elements such as sequences encoding a selectable marker that assures maintenance of the 
vector in the cell, a controllable transcriptional promoter {ie. lac) which upon induction can produce 
large amounts of mRNA from the cloned gene, translational control sequences and a polylinker to 
simplify insertion of the gene in the correct orientation within the vector. In a simple E. colt 
expression vector utilizing the lac promoter, the expression vector plasmid contains a fragment of 
the E.coli chromosome containing the lac promoter and the neighboring lacZ gene. In the presence 
of the lactose analog 1PTG, RNA polymerase normally transcribes the lacZ gene producing lacZ 
mRNA which is translated into the encoded protein, p-galactosidasc. The lacZ gene can be cut out 
of the expression vector with restriction enzymes and replaced by NAIP gene sequence. When this 
resulting plasmid is transfected into E.coli, addition oflPTG and subsequent transcription from the 
lac promoter produces NAIP mRNA, which is translated into NAIP. 

Once the appropriate expression vector containing the NAIP gene is constructed it is 
introduced into an appropriate E.coli strain by transformation techniques including calcium 
phosphate transfection, DEAE-dextran transfection, electroporation, microinjection, protoplast 
fusion and liposome-medialed transfection, 
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The host ceil which may be Iransfectcd with the vector of this invention may be selected 
from the group consisting of E.coli, pseudomonas. bacillus subtilius, or other bacili other bacteria, 
yeast, fungi, insect (using baculoviral vectors for expression), mouse or other animal or human 
tissue cells. Mammalian cells can also be used to express the NAIP protein using a vaccinia virus 
expression system. 

In vitro expression of proteins encoded by cloned DNA is also possible using the T7 late- 
promoter expression system. This system depends on the regulated expression of T7 RNA 
polymerase which is an enzyme encoded in the DNA of bacteriophage T7. The T7 RNA 
polymerase transcribes DNA beginning within a specific 23-bp promo tor sequence called the T7 
late promoter. Copies of the T7 late promoter are located at several sites on the T7 genome, but 
none is present in E.coli chromosomal DNA. As a result, in T7 infected cells, T7 RNA polymerase 
catalyzes transcription of viral genes but not of E.coli genes. In this expression system 
recombinant E.coli cells are first engineered to carry the gene encoding T7 RNA polymerase next to 
the lac promoter. In the presence oflPTG, these cells transcribe the T7 polymerase gene at a high 
rate and synthesize abundant amounts of T7 RNA polymerase. These cells are then transformed 
with plasmid vectors that cany a copy of the T7 late promoter protein. When 1PTG is added to the 
culture medium containing these transformed E.coli cells, large amounts of T7 RNA polymerase are 
produced. The polymerase then binds to the T7 late promoter on the plasmid expression vectors, 
catalyzing transcription of the inserted cDNA at a high rate. Since each E.coli cell contains many 
copies of the expression vector, large amounts of mRNA corresponding to the cloned cDNA can be 
produced in this system and the resulting protein can be radioactiveiy labelled. Plasmid vectors 
containing late promoters and the corresponding RNA polymerases from related bacteriophages 
such as T3, T5 and SP6 may also be used for in vitro production of proteins from cloned DNA. 
E.coli can also be used for expression by infection with Ml 3 Phage mGPl-2. E.coli vectors can also 
be used with phage lambda regulatory sequences, by fusion protein vectors, by maltose-binding . 
protein fusions, and by glutathione-S-transferase fusion proteins. 

A preferred expression system is the baculovirus system using, for example, the vector 
pBacPAK9, which is available from Clontech (Palo Alto, CA). If desired, this system may be used 
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in conjunction with other protein expression techniques, for example, the myc tag approach 
described by Evan et al. (MoL Cell Biol. 3:3610-3616, 1985). 

Eukaryotic expression systems permit appropriate post-translational modifications to 
expressed proteins. This allows for studies of the NAIP gene and gene product including 
determination of proper expression and posUranslationai modifications for biological activity, 
identifying regulatory elements located in the 5* region of the NAIP gene and their role in tissue 
regulation of protein expression. It also permits the production of large amounts of normal and 
mutant proteins for isolation and purification , to use cells expressing NAIP as a functional assay 
system for antibodies generated against the protein, to test the effectiveness of pharmacological 
agents or as a component of a signal transduction system, to study the function of the normal 
complete protein, specific portions of the protein, or of naturally occurring polymorphisms and 
artificially produced mutated proteins. The NAIP DNA sequence can be altered using procedures 
such as restriction en2yme digestion, DNA polymerase fill-in, exonuclease deletion, terminal 
deoxynucleotide transferase extension, ligation of synthetic or cloned DNA sequences and site- 
directed sequence alteration using specific oligonucleotides together with PCR. 

A NAIP may be produced by a stably-transfected mammalian cell line, A number of vectors 
suitable for stable transfection of mammalian cells are available to the public, e.g., see Pouwels et 
al. {supra), as are methods for constructing such cell lines (see e.g., Ausubel et ai. {supra). In one 
example, cDN A encoding a NAIP is cloned into an expression vector that includes the dihydrofolate 
reductase (DHFR) gene. Integration of the plasmid and, therefore, integration of the NAIP- 
encoding gene into the host cell chromosome is selected for by inclusion of 0.01-300 
methotrexate in the cell culture medium (as described, Ausubel et al., supra). This dominant 
selection can be accomplished in most cell types. Recombinant protein expression can be increased 
by DHFR-mediated amplification of the transfected gene. 

Methods for selecting cell lines bearing gene amplifications are described in Ausubel el al. 
(supra). These methods generally involve extended culture in medium containing gradually 
increasing levels of methotrexate. The most commonly used DHFR-containing expression vectors 
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are pCVSEII-DHFR and pAdD26SV( A) (described in Ausubel et a).-, supra). The host cells 
described above or, preferably, a DHFR-deficient CHO cell line (e.g., CHO DHFR* cells, ATCC 
Accession No. CRL 9096) are among those most preferred for DHFR selection of a stably- 
Iransfected cell line or DHFR-mediated gene amplification. 

Once the recombinant protein is expressed, it is isolated by, for example, affinity 
chromatography. In one example, an anti-N AIP antibody, which may be produced by the methods 
described herein, can be attached to a column and used to isolate the N AIP protein. Lysis and 
fractionation of NAJP-harboring cells prior to affinity chromatography may be performed by 
standard methods (see e.g., Ausubel et aL, supra). Once isolated, the recombinant protein can, if 
desired, be purified further by e.g., by high performance liquid chromatography (HPLC; e.g., see 
Fisher, f ahoratorv Techniques In Biochemist ry And Molecular Biolog y. Work and Burdon, Eds.< 
Elsevier, 1980). 

Polypeptides of the invention, particularly short NAJP fragments, can also be produced by 
chemical synthesis (e.g., by the methods described in Solid Phase Peptide Svruhes i £ T 2nd ed., 1984 
The Pierce Chemical Co., Rockford, IL). These general techniques of polypeptide expression and 
purification can also be used to produce and isolate useful NA1P fragments or analogs, as described 
herein. 

Those skilled in the art of molecular biology will understand that a wide variety of 
expression systems may be used to produce the recombinant protein. The precise host cell used is 
not critical to the invention. The NA1P protein may be produced in a prokaryotic host (e.g., E. coli) 
or in a eukaryotic host (e.g., £ cerevisiae* insect cells such as S£21 cells, or mammalian cells such as 
COS-1, N1H 3T3, or HeLa cells). These cells are publically available, for example, from the 
American Type Culture Collection, Rockville, MD; see also Ausubel et aU, Current Protocols in 
Molecular Biology John Wiley & Sons, New York, NY, 1994). The method of transduction and 
the choice of expression vehicle will depend on the host system selected. Transformation and 
transfection methods are described, e.g., in Ausubel et al. (supra), and expression vehicles may be 
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chosen from those provided, e.g. in Claning Vec t o r A 1 .^oraiorv Manual (P.H. Pouwels el aL 
1985, Supp. 1987). 

III. T^tin p for the presence of NAIP biological activity 

To analyze the effect of NAIP on apoptosis in a first approach, expression plasmids alone or 
encoding nearly full length NAIP or Bcl-2 (a protein which functions under norma! conditions to 
protect cells against apoptosis) were transfected into CHO, Rat- 1 and HeLa cells followed by G4I8 
selection. Initially, a NAIP cDNA was isolated by probing a human fetal brain cDN A library with a 
genomic DNA insert of a cosmid from the constructed cosmid library, and a cDNA fragment 
encoding most of the three BIR domains corresponding to the NAIP gene sequence was isolated. 

IV. Cellular Distribution of NAIP 

We have looked at the distribution of NAIP using immunofluorescence of labelled 
antibodies and find NAIP is expressed in at least the following tissues: motor neurons, myocardial 
cells, liver, placenta and CNS. 

V. NAIP Fragments 

The BIR domains of NAIP appear to be both necessary and sufficient for NAIP biological 
activity. Surprisingly, we have reason to believe carboxy terminal deletions of NAIP amino acids 
actually enhances inhibition of apoptosis by NAJP. Deletions may be up to the end or the last NAIP 
BIR domain (i.e., the third), but need not delete the entire region carboxy terminal to the third BIR 
domains. 

VI. NAIP Antibpdiss 

In order to prepare polyclonal antibodies, NAIP, fragments of NAIP, or fusion proteins 
containing defined portions or all of the NAIP protein can be synthesized in bacteria by expression 
of corresponding DNA sequences in a suitable cloning vehicle. Fusion proteins are commonly used 
as a source of antigen for producing antibodies. Two widely used expression systems for E.coli are 
lacZ fusions using the pUR series of vectors and trpE fusions using the pATH vectors. The protein 
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can then be purified, coupled to a carrier protein and mixed with Freund's adjuvant (to help 
stimulate the antigenic response by the rabbits) and injected into rabbits or other laboratory animals. 
Alternatively, protein can be isolated from NAIP expressing cultured ceils. Following booster 
injections at' bi-weekly intervals, the rabbits or other laboratory animals are then bled and the sera 
isolated. The sera can be used directly or purified prior to use, by various methods including 
affinity chromatography employing Protein A-Sepharose, Antigen Sepharose, Anti-mouse-lg- 
Sepharose. The sera can then be used to probe protein extracts from tissues run on a polyacrylamide 
gel to identify the NAIP protein. Alternatively, synthetic peptides can be made to the antigenic 
portions of the protein and used to innoculate the animals. 

In order to generate peptide for use in making NAlP-specific antibodies, a NAIP coding 
sequence (i.e., amino acid fragments shown in Seq. ID Nos. 22 and 24) can be expressed as a C- 
terminal fusion with glutathione S-trarisferase (GST; Smith el al.. Gene 67:3 1-40, 1988). The 
fusion protein can be purified on glutathione-Sepharose beads, eluted with glutathione, and cleaved 
with thrombin (at the engineered cleavage site), and purified to the degree required to successfully 
immunize rabbits. Primary immunizations can be carried out with Freund's complete adjuvant and 
subsequent immunizations performed with Freund's incomplete adjuvant. Antibody titres are 
monitored by Western blot and immunoprecipitation analyses using the thiombin-cteaved NAIP 
fragment of the GST-NAIP fusion protein. Immune sera are affinity purified using CNBr- 
Sepharose-coupied NAIP protein. Antiserum specificity is determined using a panel of unrelated 
GST proteins (including GST P 53, Rb, HPV-16 E6, and E6-AP) and GST-trypsin (>»hich was 
generated by PCR using known sequences). 

It is also understood by those skilled in the art that monoclonal NAIP antibodies may be 
produced by culturing cells actively expressing the protein or isolated from tissues. The cell 
extracts, or recombinant protein extracts, containing the NAIP protein, may for example, be injected 
in Freund's adjuvant into mice. After being injected, the mice spleens may be removed and 
resuspended in phosphate buffered saline (PBS). The spleen cells serve as a source of lymphocytes, 
some of which are producing antibody of the appropriate specificity. These are then fused with a 
permanently growing myeloma partner cells, and the products ,f the fusion are plated into a number 
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of tissue culture wells in the presence of a selective agent such as HAT. The wells are then screened 
by EUSA to identify those containing cells making binding antibody. These are then plated and 
after a period of growth* Ihese wells are again screened to identify antibody-producing cells. 
Several cloning procedures are carr ied out until over 90% of the wells contain single clones which 
are positive for antibody production. From this procedure a stable line of clones which produce the 
antibody is established. The monoclonal antibody can then be purified by affinity chromatography 
using Protein A Sepharose, ion-exchange chromatography, as well as variations and combinations 
of these techniques. Truncated versions of monoclonal antibodies may also be produced by 
recombinant methods in which plasmids are generated which express the desired monoclonal 
antibody fragment(s) in a suitable host. 

As an alternate or adjunct immunogen to GST fusion proteins, peptides corresponding to 
relatively unique hydrophilic regions of NAIP may be generated and coupled to keyhole limpet 
hemocyanin (KLH) through an introduced C-terminal lysine. Antiserum to each of Ihese peptides is 
similarly affinity purified on peptides conjugated to BSA. and specificity is tested by ELfSA and 
Western blotting using peptideconjugates, and by Western blotting and immunoprccipitation using 
N AIP expressed as a GST fusion protein. 

Alternatively, monoclonal antibodies may be prepared using the NAIP proteins described 
above and standard hybridoma technology (see, e.g.. Kohler et al., Nature 256:495, 1975; Kohler et 
al., Eur. J. Immunol. 6:5 U, 1976; Kohler et al., Eur. J Immunol. 6:292, 1976; Hammerling ct al.. In 
MgQCSlfllia] Antibn1i~i ?nti T Cej] Hvhridomas. Elsevier, New York, NY, 198 1 ; Ausubel et al., • 
supra). Once produced, monoclonal antibodies are also tested for specific NAIP recognition by 
Western blot or immunoprecipitalion analysis (by the methods described in Ausubel et al.. supra). 

Antibodies that specifically recognize NAIP (or fragments of NAIP), such as those described 
herein containing one or more B1R domains are considered useful in the invention. They may, for 
example, be used in an immunoassay to monitor NAIP expression levels or to determine the 
subcellular location of a NAIP or NAIP fragment produced by a mammal. Antibodies that inhibit 
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NAIP described herein may be especially useful in inducing apoptosis in cells undergoing 
undesirable proliferation. 

Preferably, antibodies of the invention are produced using NAIP sequence that does nol 
reside within highly conserved regions, and that appears likely to be antigenic, as analyzed by 
criteria such as those provided by the Peptide structure program (Genetics Computer Group 
Sequence Analysis Package, Program Manual for the GCG Package, Version 7, 1991} using the 
algorithm of Jameson and Wolf (CABIOS 4:181, 1988). These fragments can be generated by 
standard techniques, e.g. by the PCR, and cloned into the pGEX expression vector (Ausubel et aL 
supra). Fusion proteins are expressed in E. coli and purified using a glutathione agarose affinity 
matrix as described in Ausubel et al. {supra), In order to minimize the potential for obtaining 
antisera that is non-specific, or exhibits low-affinity binding to NAIP, two or three fusions are 
generated for each protein, and each fusion is injected into at least two rabbits. Antisera are raised 
by injections in series, preferably including at least three booster injections. 

VII. ] lfif r>f NAIP Antibodies 

Antibodies to NAIP may be used, as noted above, to detect NAIP or inhibit the protein. In 
addition, the antibodies coupled to compounds for diagnostic and/or therapeutic uses such as 
radionuclides for imaging and therapy and liposomes for the targeting of compounds to a specific 
tissue location. 

VHL Hptertion of NAIP gene expression 

As noted, the antibodies described above may bv used to monitor NAIP protein expression. 
In addition, in situ hybridization is a method which may be used to detect the expression of the 
N AIP gene. In situ hybridization relies upon the hybridization of a specifically labelled nucleic acid 
probe to the cellular RNA in individual cells or tissues. Therefore, it allows the identification of 
mRNA within intact tissues, such as the brain. In this method, oligonucleotides or cloned 
nucleotide (RNA or DNA) fragments corresponding to imique portions of the NAIP gene are used 
to detect specific mRNA species, e.g., in the brain. In this method a rat is anesthetized and 
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transcardially perfused with cold PBS. followed by perfusion with a formaldehyde solution. The 
brain 01 other tissues is then removed, frozen in liquid nitrogen, and cut into thin micron sections. 
The sections are placed on slides and incubated in proteinase K. Following rinsing in DEP, water 
and ethanol, the slides are placed in prehybridizalion buffer. A radioactive probe corresponding to 
the primer is made by nick translation and incubated with the sectioned brain tissue. After 
incubation and air drying, the labelled areas are visualized by autoradiography. Dark spots on the 
tissue sample indicate hybridization of the probe with NA1P rnRXA which demonstrates the 
expression of the protein. 

1X . M f n! if.r a tion gi M oiseaks thai Modulate N MP Protein Express i on 

NA1P cDNAs may be used to facilitate the identification of molecules that increase of 
decrease NAIP expression. In one approach, candidate molecules are added, in varying 
concentration, to the culture medium of cells expressing NAIP mRN A. NAIP expression is then 
m easured, for example, by Northern blot analysis (Ausubel eta!., supra) using a NA1P cDNA, or 
cDNA or RNA fragment, as a hybridation probe. The level of NAIP expression in the presence of 
the candidate module is compared to the level of NAIF expression in the absence of the candidate 
molecule, all other factors (e.g. cell type and culture conditions) being equal. 

The effett of candidate molecules on NAlP-medialed apoptosis may, instead, be measured at 
the level of translation by using the general approach described above with standard protein 
detection techniques, such as Western blotting or immunoprecipitation with a NAlP-speafic 
antibody (for example, the NAIP antibody described herein). 

Compounds that modulate the level of NAIP may be purified, or substantially purified, or 
may be one component of a mixture of compounds such as an extract or supernatant obtained from 
cells (Ausubel et a... supra). In an assay of a mixture of compounds. NAIP expression is tested 
against progressively smaller subsets of the compound poo. (e.g.. produced by standard purificauon 
techniques such as HPLC or FPLC) until a single compound or minimal number of effecUve 
compounds is demonstrated to modulate NAIP expression. 
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Compounds may also be screened for their ability to modulate NAIP apoptosis inhibiting 
activity. In this approach, the degree of apoptosis in the presence of a candidate compound is 
compared to the degree of apoptosis in its absence, under equivalent conditions. Again, the screen 
may begin with a pool of candidate compounds, from which one or more useful modulator 
compounds are isolated in a step-wise fashion. Apoptosis activity may be measured by any 
standard assay, for example, those described herein. 

Another method for delecting compounds that modulate the activity of NAlPs is to screen 
for compounds that interact physically with a given NAIP polypeptide. These compounds may be 
detected by adapting interaction trap expression systems known in the art. These systems detect 
protein interactions using a transcriptional activation assay and are generally described by Gyuris et 
al (Cell 75:791-803, 1993) and Field et a!., Nature 340:245-246. 1989), and are commercially 
available from Cloniech (Palo Alto, CA). In addition, PCT Publication WO 95*8497 describes an 
interaction trap assay in which proteins involved in apoptosis, by virtue of their interaction with 
Bcl-2, are detected. A similar method may be used to identify proteins and other compounds that 
interact with NAIP. 

Compounds or molecules that function as modulators of NAIP-medialed cell death may 
include peptide and non-peptide molecules such as those present in cell extras, mammalian serum 
or growth medium in which mammalian ceils have been cultured. 

A molecule that promotes an increase in NAIP expression or NAIP rctivity is considered 
particularly useful in the invention; such a molecule may be used, for example, as a therapeutic to 
increase cellular levels of NAIP and thereby exploit the ability of NAIP polypeptides to inhibit 
apoptosis. 

A molecule that decreases NAIP activity (e.g., by decreasing NAIP gene expression or 
polypeptide activity) may be used to decrease cellular proliferation. TOs would be advantageous 
the treatment of neoplasms or other cell proliferative diseases. 
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Molecules that are found, by the methods described above, to effectively modulate NAIP 
gene expression or polypeptide activity may be tested further in animal models. If they continue to 
function successfully in an in vivo setting, they may be used as therapeutics to either inhibit or 
enhance apoptosis, as appropriate. 

X. Iberacies 

Therapies may be designed to circumvent or overcome an NAIP gene defect or inadequate 
NAIP gene expression, and thus moderate and possibly prevent apoptosis. The NAIP gene is 
expressed in the liver, myocardium, and placenta, as well as in the CNS. Hence, in considering 
various therapies, it is understood that such therapies may be targeted at tissue other than the brain, 
such as the liver, myocardium, and any other tissues subsequently demonstrated to express NAIP. 

a) fnfirin Therapy 

Treatment or prevention of apoptosis can be accomplished by replacing mutant or 
insufficient NAIP protein with normal protein, by modulating the function of mutant protein, or by 
delivering normal NAIP protein to the appropriate cells. Once the biological pathway of the NAIP 
protein has been completely understood, it may also be possible to modify the pathophysiologic 
pathway (e.g., a signal transduction pathway) in which the protein participates in order to correct the 
physiological defect. 

To replace a mutant protein with normal protein, or to add ; •otem to cells which no longer 
express sufficient NAIP, it is necessary to obtain large amounts ot nure NAIP from cultured cell 
systems which can express the protein. Delivery of the protein to the affected tissues can then be 
accomplished using appropriate packaging or administrating systems. Alternatively, small molecule 
analogs may be used and administered to act as NAIP agonists and in this manner produce a desired 
physiological effect. Methods for finding such molecules are provided herein. 

b) One Theracv 

Gene therapy is another potential therapeutic approach in which normal copies or the NAIP 
gene are introduced into selected tissues to successfully code for normal and abundant protein in 
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affected cell types. The gene must be delivered to those cells in a form in which it can be taken up 
and code for sufficient protein to provideefiective function. Alternatively, in some mutants it may 
be possible to prevent apoptosis by introducing another copy of the homologous gene bearing a 
second mutation in that gene or to alter the mutation, or use another gene to block any negative 
effect. 

Transducing retroviral vectors can be used for somatic cell gene therapy especially because 
of their high efficiency of infection and stable integration and expression. The targeted cells 
however must be able to divide and the expression of the levels of normal protein should be high. 
The full length NAIF gene, or portions thereof, can be cloned into a retroviral vector and driven 
from its endogenous promoter or from the retroviral long terminal repeat or from a promoter 
specific for the target cell type of interest (such as neurons). Other viral vectors which can be used 
include adeno-associaied virus, vaccinia virus, bovine papilloma virus, or a herpes virus such as 
Epstein-B air virus. 

Gene transfer could also be achieved using non-viral means requiring infection in vitro. 
This would include calcium phosphate, DEAL dextran, electroporation, and protoplast fusion. 
Liposomes may also be potentially beneficial for delivery of DNA into a cell. Although these 
methods are available, many of these are lower efficiency. 

Antisense based strategies can be employed to explore NA1P gene function and as a basis for 
therapeutic drug design. The principle is based on the hypothesis that sequence-specific 
suppression of gene expression can be achieved by intracellular hybridization between mRNA and a 
complementary antisense species. The formation of a hybrid RNA duplex may then interfere with 
the processing/transport/translation and/or stability of the target NAIP mRNA. Antisense strategies 
may use a variety of approaches including the use of antisense oligonucleotides, injection of 
antisense RNA and transfection of antisense RNA expression vectors. Antisense effects can be 
induced by control (sense) sequences, however, the extent of phenotypic changes are highly 
variable. Phenotypic effects induced by antisense effects are based on changes in criteria such as 
protein levels,-pTOlein activity measurement, and target mRNA levels. 
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Transplantation of normal genes into the affected cells of a patient can also be useful 
therapy. In this procedure, normal NAIP is transferred into a cullivatable cell type , either 
exogenously or endogenous^ to the patient. These cells are then injected serologically into the 
targeted tissue(s)., 

Retroviral vectors, adenoviral vectors, adeno associated viral vectors, or other viral vectors, 
with the appropriate tropism for cells likely to be involved in apopiosis (for example, epithelial 
cells) may be used as a gene transfer delivery system for a therapeutic NAIP gene construct. 
Numerous vectors useful for this purpose are generally known (Miller/Human Gene Therapy 1 5-14, 
1990; Friedman, Science 244:1275-1281, 1989; Eglitis and Anderson, BioTechniques 6:608-614, 
1988; Tolsioshev and Anderson, current opinion in Biotechnology 1 :55-61, 1990; Sharp, The : 
Lancet 337:1277-1278, -1991; Cometta et al. T Nucleic Acid Research and Molecular Biology 
36:31 1-322. 1987; Anderson, Science 226:401-409, 1984; Moen, Blood Cells 17:407-416, 1991; 
Miller et al., Biotechniques 7:980-990, 1989; Le Gal La Salle et at., Science 259:988-990, 1993; 
and; Johnson, Chest 1 07:77S-83S, 1 995). Retroviral vectors arc particularly well developed and 
have been used in clinical settings (Rosenberg et at., N. EngL J. Med 323:370. 1990; Anderson et 
al., U.S. Patent No. 5,399,346). Non-viral approaches may also be employed for the introduction of 
therapeutic DNA into celts otherwise predicted to undergo apopiosis. For example, NAIP may be 
introduced into a neuron or a T cell by lipofeclion (Feigner et al, Proc. Natl. Acad. Sci. USA 
84:7413, 1987; Ono et al., Neurosci. Lett. 117:259, 1990; Brigharn et al., Am. J. Med. Sci. 298:278, 
1989; Staubinger et al., Meth. Enz. 101:512, 1983),asialorosonucoid-polylysine conjugation {Wu et 
at., J. Biol Chem. 263:14621, 1988; Wu et al., J. Biol. Chem. 264:16985, 1989); or, less 
preferably, microinjection under surgical conditions (Wolff et ah, Science 247:1465, 1990). 

For any of the methods of application described above, the therapeutic NAIP DNA construct 
is preferably applied to the site of the predicted apopiosis event (for example, by injection). 
However, it may also be applied to tissue in the vicinity of the predicted apoptosis event or to a 
blood vessel supplying the cells predicted to undergo apoptosis. 
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In the constructs described, NAIP cDNA expression can be directed from any suitable 
promoter (e.g., the human cytomegalovirus (CMV), simian virus 40 (SV40), or metallothionein 
promoters), and regulated by any appropriate mammaiian regulatory element. For example, if 
desired, enhancers known to preferentially direct gene expression in neural cells, T cells, or B cells 
may be used to direct NAIP expression. The enhancers used could include, without limitation, 
those that are characterized as tissue- or cell-specific in their expression. Alternatively, if a NAIP 
genomic clone is used as a therapeutic construct (for example, following its isolation by 
hybridization with the NAIP cDNA described above), regulation may be mediated by the cognate 
regulatory sequences or, if desired, by regulatory sequences derived from a heterologous source, 
including any of the promoters or regulatory elements described above. 

Less preferably, NAIP gene therapy is accomplished by direct administration of the NAIP 
mRNA or antisense NAIP mRNA to a cell that is expected to undergo apoptosis. The mRNA may 
be produced and isolated by any standard technique, but is most readily produced by in vitro 
transcription using a NAIP cDNA under the control of a high efficiency promoter (e.g., the 
T7 promoter). Administration of NAIP antisense or mRNA to cells mRNA can be carried out by 
any of the methods for direct nucleic acid administration described above. 

Ideally, the production of NAIP protein by any gene therapy approach will result in cellular 
levels of NAIP thai are at least equivalent to the normal, cellulaT level of NAIP in an unaffected cell- 
Treatment by any NAIP-mediated gene therapy approach may be combined with more traditional 
therapies. 

Another therapeutic approach within the invention involves administration of recombinant 
NAIP protein, either directly to the site of a predicted apoptosis event (for example, by injection) or 
sysiemically (for example, by any conventional recombinant protein administration technique). The 
dosage of NAIP depends on a number of factors, including the size and health of the individual 
patient, but, generally, between [CM mg and 100 mg| inclusive are administered per day to an adult 
in any pharmaceutical^ acceptable formulation. 
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xi. a^m ioisiiaiiap Q f naip Polypeptides NAIP Penes sLModnlaiflis 

? f >IA1P Synlhesis at Function 

A NAIP protein, gene, or modulator may be administered within a pharmaceutically- 
acceptable diluent, carrier, or excipient, in unit dosage form. Conventional pharmaceutical practice 
may be employed to provide suitable formulations or compositions to administer NAIP to patients 
suffering from a disease that is caused by excessive apoptosis. Administration may begin before the 
patient is symptomatic. Any appropriate route of administration may be employed, for example, 
administration may be parenteral, intravenous, intraarterial, subcutaneous, intramuscular, 
intracranial, intraorbital, ophthalmic, intraventricular, intracapsular, intraspinal, imracistemal. 
intraperitoneal, intranasal, aerosol, by suppositories, or oral administration. Therapeutic 
formulations may be in the form of liquid solutions or suspensions; for oral administration, 
formulations may be in the form of tablets or capsules; and for intranasal formulations, in the form 
of powders, nasal drops, or aerosols. 

Methods well known in the art for making formulations arc found, for example, in 
-Remington's Pharmaceutical Sciences." Formulations for parenteral administration may, for 
example, contain excipients. sterile water, or saline, polyalkylenc glycols such as polyethylene 
glycol, oils of vegetable origin, or hydrogenated napthalenes. Biocompatible, biodegradable lactide 
polymer, lactide/glycolide copolymer, or polyoxyethylene-polyoxypropylene copolymers may be 
used to control the release of ihe compounds. Other potentially useful parenteral delivery systems 
for NAIP modulatory compounds include ethylene-vinyl acetate copolymer particles, osmotic 
pumps, implantable infusion systems, and liposomes. Formulations for inhalation may contain 
excipients. for example, lactose, or may be aqueous solutions containing, for example, 
polyoxyethylene-9-lauryl ether, glycocholate and deoxycholate, or may be oily solutions for 
administration in the form of nasal drops, or as a gel. 

If desired, treatment with a NAIP protein, gene, or modulatory compound may be combined 
with more traditional therapies for the disease such as surgery, steroid therapy, or chemotherapy for 
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autoimmune disease; antiviral iherapy for AIDS; and cissue plasminogen activator fTPA) for 
ischemic injury. 

XII. Dgieetion of Conditions Involving Altered Anootosis 

NAIP polypeptides and nucleic acid sequences find diagnostic use in the detection or 
monitoring of conditions involving aberrant levels of apoptosis. For example, decrease expression 
of NAIP may be correlated with enhanced apoptosis in humans (see XII, below). Accordingly, a 
decrease or increase in the level of NAIP production may provide an indication of a deleterious 
condition. Levels of NAIP expression may be assayed by any standard technique. For example, 
NAIP expression in a biological sample (e.g., a biopsy) may be monitored by standard Northern blot 
analysis or may be aided by PGR (see, e.g., Ausubel et al., supra\ PCR Technology- Principles and 
ftpplicahons for DN A Amplification . H A. Ehriich, Ed. Stockton Press, NY; Yap et al. Nucl. Acids. 
Res. 19:4294, 1991). 

Alternatively, a biological sample obtained from a patient may be analyzed for one or more 
mutations in the NAIP sequences using a mismatch detection approach. Generally, these techniques 
involve PGR amplification of nucleic acid from the patient sample, followed by identification of the 
mutation (i.e., mismatch) by either altered hybridization, aberrant electrophoretic gel migration, 
binding or cleavage mediated by mismatch binding proteins, or direct nucleic acid sequencing. Any 
of these techniques may be used to facilitate mutant NAIP detection, and each is well known in the 
art; examples of particular techniques are described, without limitation, in Oriia et al, Prbc. Natl. 
Acad. Sci. USA 86:2766-2770, 1989; Sheffield et al., Proc. Natl Acad. Sci. USA 86:232-236, 
1989). 

In yet another approach, immunoassays are used to detect or monitor NAIP protein in a 
biological sample. NAIP specific polyclonal or monoclonal antibodies (produced as described 
above) may be used in any standard immunoassay format (e.g., ELIS A, Western bloi, or R1A) to 
measure NAIP polypeptide levels. These levels would be compared to wild-type NAIP levels, with 
a decrease in NAIP production indicating a condition involving increased apoptosis. Examples of 
immunoassays are described, e.g., in Ausubel el al., supra. Immunohistochemicai techniques may 
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a ,so be uUUied for NAIP detection. For example, a tissue sampie may be obtained from a pane,,, 
sectioned, and stained for the presence of NA1P using an anti-NAlP antibody and any standard 
detection svs.em (e.g.. one which includes a secondary antibody conjugated to horserad.sh 
peroxidase). Genera, guidance regarding such techniques can be found in. e.g.. Bancroft and 
_L ..Lit- ilklU ' ml rmi~ tTf H^"» archill Livingstone. 1982) and Ausubei 

et ai. [supra)- 

In one preferred example, a combined diagnostic method may be employed that begins with 
an evaluation of NAIP protein production (for example, by immunological techniques or the protein 
truncation test (Hogerrorst et al.. Nature Genetics .0:208-212, 1995) and also includes a nuc.etc 
acid-based detection technique designed to identify more subtle NAIP mutations (for example. P omt 
mutations). As described above, a number of mismatch detection assays are available to those 
skilled in the art. and any preferred technique may be used. Mutations in NAIP may be detected 
that either result in loss of NAIP expression or loss of NAIP biological activity. In a variafon of 
this combined du.nostic method. NAIP biological activity is measured as anti-apoptot.c act.vuv 
using any appropriate apoptosis assay system (for example, those described herein). 

Mismatch detection assays also provide ah opportunity to diagnose a NAlP-mediated 
predisposition to diseases caused by inappropriate apoptosis. For example, a patient heterozygous 
for a NAIP mutation may show no clinical symptoms and yet possess a higher than normal 
probabilitv of developing one or more types of neurodegenerative, myelodysplasia or having severe 
sequelae to an ischemic event. Given this diagnosis, a patient may take precautions to mtn.m.ze 
their exposure to adverse environmental factors (for example, UV exposure or chemical mutagens) 
and to carefully monitor their medical condition (for example, through frequent physical 
examinations). This type of NAIP diagnostic approach may also be used to detect NAIP mutations 
in prenatal screens. The NAIP diagnostic assays described above may be carried out using any 
biological sample (for example, any biopsy sample or other tissue) in which NAIP is normally 
expressed. Identification of a mutant NAIP gene may also be assayed using these sources for test 
samples. 
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-Alternatively, a NA1P mutation, particularly as part of a diagnosis for predisposition to 
NAIP-associated degenerative disease, may be tested using a DNA sample from any cell, for 
example, by mismatch detection techniques. Preferably, the DNA sample is subjected to PCR 
amplification prior to analysis. 

XIII. Preventative Ami-Anotnotjc T herapy 

In a patient diagnosed to be heterozygous for a NAIP mutation or to be susceptible to NA1P 
mutations (even if those mutations do not yet result in alteration or loss of NAIP biological 
activity), or a patient diagnosed with a degenerative disease (e.g., motor neuron degenerative 
diseases such as SMA or ALS diseases), or diagnosed as HIV positive, any of the above therapies 
may be administered before the occurrence of the disease phenotype. For example, the therapies 
may be provided to a patient who is HIV positive but does not yet show a diminished T cell count or 
other overt signs of AIDS. In particular, compounds shown to increase NAIP expression or NAIP 
biological activity may be administered by any standard dosage and route of administration (see 
above). Alternatively, gene therapy using a NAIP expression constnict may be undertaken to 
reverse or prevent the cell defect prior to the development of the degenerative disease. 

The methods of the instant invention may be used to reduce or diagnose the disorders , 
described herein in any mammal, for example, humans, domestic pets, or livestock. Where a non- 
human mammal is treated or diagnosed, the NAIP polypeptide, nucleic acid, or antibody employed 
is preferably specific for that species. 

xv. Idattikaiion of Additional NAIP Genes 

Standard techniques, such as the polymerase chain reaction (PCR) and DNA hybridization, 
may be used to clone additional NAIP homologues in other species. Southern blots of murine 
genomic DNA hybridized at low stringency with probes specific for human NAIP reveal bands that 
correspond to NAIP and/or related family members. Thus, additional NAIP sequences may be 
readily identified using low stringency hybridization. Examples of murine and human NAIP- 
specific primers, which may be used to clone additional genes by RT-PCR. 
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XVI. rharacterization of NAT P Activity and hir^pl hilar 1 realization Smdje/5 

The ability ofNAlP io modulate apoptosis can be defined in in viiro systems in which 
alterations of apoptosis can be detected. Mammalian expression constructs carrying NAIP cDNAs, 
which are either full-length or truncated, can be introduced into cell lines such as CHO, NIH 3T3, 
HL60, Rat- 1 , or Jurkat cells. In addition, SF21 insect cells may be used, in which case the NAIP 
gene is preferentially expressed using an insect heat shock promotor; Following transfection. 
apoptosis can be induced by standard methods, which include serum withdrawal, or application of 
staurosponne, menadione (which induces apoptosis via free radical formation), or anti-Fas 
antibodies. As a control, cells are cultured under the same conditions as those induced to undergo 
apoptosis, but either not transfected, or transfected with a vector that lacks a NAIP insert. The 
ability of each NAIP construct to inhibit apoptosis upon expression can be quantified by calculating 
the survival index of the cells, i.e., the ratio of surviving transfected cells to surviving control cells. 
These experiments can confirm the presence of apoptosis inhibiting activity and, as discussed 
below, can also be used to determine the functional region(s) of a NAIP. These assays may also be 
performed in combination with the application of additional compounds in order to identify 
compounds that modulate apoptosis via NAIP expression. 

XVII. Sam ples of Additional Apoptos is A ssays 

Speci fic examples of apoptosis assays are also provided in the following references. Assays 
for apoptosis in lymphocytes are disclosed by: Li et ai., "Induction of apoptosis in uninfected 
lymphocytes by HIV-I Tat protein", Science 268:429-431, 1995; Gibellini et al., Tat-expressing 
Jurkat cells show an increased resistance to different apoptotic stimuli, including acute human 
immunodeficiency virus-type 1 (HIV-1) infection", Br. J. Haematol. 89:24-33, 1995; Martin et al., 
"HIV- 1 infection of human CD4* T cells in vitro. Differential induction of apoptosis in these cells." 
J. Immunol. 152:330-42, 1994; Terai et al., "Apoptosis as a mechanism of cell death in cultured 
T lymphoblasts acutely infected with HIV-1", J. Clin. Invest. 87:1710-5, 1991 ; Dhein et al., 
"Autocrine T-cell suicide mediated by APO-l/<Fas/CD95)l I , Nature 373:438-441, 1995; Katsikis et 
al., "Fas antigen stimulation induces marked apoptosis of T lymphocytes in human 
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immunodeficiency virus-infected individuals", J. Exp. Med, 1815:2029-2036, 1995; Westendorp ei 
at, Sensitization of T celts to CD95-mediated apoptosis by H1V-1 Tat and gpl20", Nature 375:497, 
1995; DeRossi et at, Virology 198:234-44, 1994, 

Assays for apoptosis in fibroblasts are disclosed by: Vossbeck et at, "Direct transfdrming 
activity of TGF-beta on rat fibroblasts", Int. J. Cancer 61:92-97, 1995; Goruppi et al., "Dissection of 
c-myc domains involved in S phase induction of NIH3T3 fibroblasts", Oncogene 9:1537-44, 1994; 
Fernandez et al., "Differential sensitivity of normal and Ha-ras transformed C3H mouse embryo 
fibroblasts to tumor necrosis factor: induction of bcl-2, c-myc, and manganese superoxide dismutase 
in resistant cells", Oncogene 9:2009-17, 1994; Harrington et at, "c-Mycinduced apoptosis in 
fibroblasts is inhibited by specific cytokines", EMBO J., 13:3286-3295, 1994; Itoh et at., "A novel 
protein domain required for apoptosis. Mutational analysis of human Fas antigen", J. Biol. Chem. 
268:10932-7,1993. 

Assays for apoptosis in neuronal cells are disclosed by: Melino et at, "Tissue 
transglutaminase and apoptosis: sense and antisense transfection studies with human neuroblastoma 
cells", Mol. Cell Biol. 14:6584-6596, 1994; Rosenbaum et at, "Evidence for hypoxia-induced, 
programmed cell death of cultured neurons", Ann. Neurol. 36:864-870, 1994; Sato et al, "Neuronal 
di fferentiation of PC12 cells as a result of prevention of cell death by bcl-2\ J. Neurobiol 25:1227- 
1234, 1994; Ferrari et ah, M N-acetylcysieine D- and L-stereoisomers prevents apoptotic death of 
neuronal cells", J. Neurosci. 1516:2857-2866, 1995; Talley et at, "Tumor necrosis factor alpha- 
induced apoptosis in human neuronal cells: protection by the antioxidant N-acetyicysteine and the 
genes bcl-2 and crma", Mol. Cell Biol. 1585:2359-2366, 1995; Talley ei at, "Tumor Necrosis 
Factor Alpha- Induced Apoptosis in Human Neuronal Cells: Protection by the Antioxidant N- 
Acctylcysteine and the Genes bcl-2 and crrna", Mol. Cell. Biol. 15:2359-2366, 1995; Walkinshaw et 
at, "Induction of apoptosis in caiecholaminergic PC12 cells by L-DOPA. Implications for the 
treatment of Parkinson's disease/ 1 , J. Clin. Invest. 95:2458-2464, 1995. 

Assays for apoptosis in insect cells are disclosed by: Clem et at. "Prevention of apoptosis 
by a baculovirus gene during infection of insect cells", Science 254:1388-90, 1991; Crook et at. 



36 



WO 97/26331 



PCT/1B97/00142 



"An apoptosis-.nhibiting baculovirus gene wiih a zinc finger-like motif , J. Virol. 67:2568-74. 
1993; Rabizadeh et al,, "Expression of the baculovims P 35 gene inhibits mammalian neural cell 
death", J. Neurochem. 61:2318-21, 1993; Bimbaum el al., "An apoptosis inhibiting gene from a 
nuclear polyhedrosis virus encodmg a polypeptide with Cys/His sequence motifs", J. Virol. 
68:2521-8, 1994; Clem et al., Mol. Cell Biol. 14:5212-5222, 1994. 

XV11I. CQDS lEHaiOD Oi » Transgenic Animal 

Characterization ofNAJP genes provides information that is necessary for a NAIP knockout 
animal model to be developed by homologous recombination. Preferably, the model is a 
mammalian animal, most preferably a mouse. Similarly, an animal model of NAlP overproduce 
may be generated by iniegrating one or more NAIP sequences into the genome, according to 
standard transgenic techniques. 

A replacement-type targeting vector, which would be used to create a knockout model, can 
be constructed using an isogenic genomic clone, for example, from a mouse strain such as 129/Sv 
(Stratagenc Inc., LaJolla, CA). The targeting vector will be introduced into a suitably-derived line 
of embryonic stem (ES) cells by electroporation to generate ES cell lines that carry a profoundly 
truncated form of a NAIP. To generate chimeric founder mice, the targeted cell lines will be 
injected into a mouse. blastula stage embryo. Heterozygous offspring will be interbred to 
homozygosity. Knockout mice would provide the means, in vivo, to screen for therapeutic 
compounds that modulate apoptosis via an NAIP-dependem pathway. Making such mice may 
require use of loxP sites due to the multiple copies of NAIP on the chromosome (see Sauer and 
Henderson, Nucleic Aids Res. 17: 147-61 (1989)). 

Examples 

The examples are meant to illustrate, not limit the invention. 
Rumple 1 Expression of NAIP in Rat-1, CHO and HeLa pooled stable lines and adenovirus 
infected cells analysed by Western blotting and immunofluorescence. 



37 



WO 97/26331 PCHIB97/00142 



To generate nearly 3.7 kb NAIP construct lagged with the myc epitope (!) MTG-SP3.7, a 2.5 
kb Bsu36I/Sall fragment of NAIP cloned into Bluescript and (ii) Bsu36I/Xhol cut MTG-SE1.7, the 
expression vector pcDNA3 containing a 300 bp myc epitope and a 1.7 kb fragment of NAIP were 
ligated. HeLa, CHO and Rat* I cells were transfected by iipofection (Gibco BRL) with 8 ,ug DNA 
and G4 1 8 resistant transformants were selected by maintaining the cells in 250 UgftnU 400 ^g/ml 
and 800^g/ml G4 18 respectively. Ail cells were maintained in Eagles medium containing 10% fetal 
calf serum. For construction of the adenovirus, a 3.7 kb BamHl fragment of NAIP was cloned into 
the Swal site of the adenovirus expression cosmid pAdexlCAwt. Production of vectors* 
purification by double cesium chloride gradient and titer determination was as described in 
Rosenfeld, M.A. er uL 1992, and Graham, F.L. and Van Der Eb, A. 1973. 

Western blot analysis was performed using mouse anti-human myc monoclonal antibody 
(Ellison, M.J, and Hochsirasser, M.J. 1991) or rabbit anti-human NAIP ( E 1. 0) polyclonal antibody. 
For NAIP antibody production, rabbits were immunized with purified bacterial produced fusion 
protein in complete Freunds adjuvant Serum was pre-cleared with GST protein and anti-NAlP 
immunoglobin purified with immobilized GST-NAJP fusion proteins. 

For immunofluorescence, cells were grown on glass slides, fixed with formaldehyde for 10 
minutes, incubated with anti-NAlP (1:200) or anti-myc (t JO) in PBS, 0.3% Triton X-lOO™ for I 
hour followed by incubation with secondary antisera, FITC-labelled donkey anti-rabbit 
immunoglobulin (Amersham), biotinylated goal anti-mouse immunoglobulin (Amersham) and 
streplavidin Texas-Red™ (Amersham). 

Example 2 The Effect of NAIP on Cell Death Induced by Serum Deprivation, Menadione and 
TNF-a. 

For each assay cells were plated at 5 x 104 ml in triplicate. CHO or Rat-1 cells were treated 
with menadione for 1.5 hours, washed 5 times in PBS and maintained in normal media. For serum 
deprivation assays, cells were washed 5 times in PBS and maintained in media with 0% fetal calf 
serum. HeLa cells were treated with 20 units/ml TNTwx in combination with 30 g/ml 
cyclohexamide for 17 hours. Apoptosis was assayed for each trigger by propidium iodide staining. 
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Adenovirus infected cells were subjected to triggers 36 hours post infection. UcZ expression was 
confirmed histochemically by 5-bromo-4-chloro-3-indoyl-P-D-ga,actoside (X-gal) as described in 
Ellison, M.J. and Hochstrasscr. M.J. 1991 . Transcription of P1AN was determined by in situ 
hybridization using the DIG labelled sense oligonucleotide following the manufacturers protocol 
(Boehringer Mannheim). The human Bcl-2 clone pB4 (ATCC) was digested with EcoRl and 
ligated into the EcoRl site of pcDNA3. 

For adenovirus assays an adenovirus encoding LacZ. antisense NAIP (NAIP) or vector alone 
with no insen were utilized as controls. Bcl-2 was utilized as a positive control and pcDNA alone 
as a negative control in cell line assays. Cell viability was determined by trypan blue exclusion. 
Date are presented as averages of three independently derived transfected pools or infections. 

Examnk 3 Immunofluorescence Analysis of Human Spinal Cord Tissue. 

Human tissues were obtained at autopsy from a 2 month old infant that died of non- 
neurological causes and stored at -80'C. 14 »M cryostat sections were fixed in formaldehyde for 
20 minutes, nnsed in PBS and incubated in blocking solution (2% horse serum, 2% casien. 2% BS A 
in PBS) for 15 minutes prior to overnight incubation with anti-NAIP antiscra diluted in this 
blocking solution. CY-3 labelled donkey anti-rabbit immunoglobulin (Sigma) was utilized as 
secondary anlisera, 

Example 4 Isolating and cloning the NAIP gene 
PAC Contig Array 

The 40G1 CATT subloci demonstrated linkage disequilibrium and therefore a PAC 
contiguous array containing the CATT region was constructed. This PAC contig array comprised 9 
clones and extended approximately 400 kb. Genetic analysis combined, with the physical mapping 
data indicated that the 40G1 CATT subloci marker which showed the greatest disequilibrium with 
SMA was duplicated and was localized at the extreme centromeric of the critical SMA interval. 
Consequently the 154 kb PAC clone 125D9 which contained within lOkb of its centromeric end the 
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SMA interval defining CMS allele 9 and extended lelemetricaliy to incorporate the 40G? CATT 
sublocus was chosen for funher examination. 

Two genomic libraries were constructed by performing complete and partial (average insert 
size 5 kb) Sau3Al on PAC 125D9 and cloning the restricted products into BamHl digested 
Bluescript plasmids. Genomic sequencing was conducted on both termini of 200 clones from the 5 
kb insert partial Sau3Al library in the manner of {Chen el al., 1 993) permitting the construction of 
contiguous and overlapping genomic clones covering most of the PAC. This proved instrumental in 
the elucidation of the neuronal apoptosis inhibitor protein gene structure. 

PAC 125D9 is cleaved into 30 kb centromere and 125 kb tetomeric fragments by a Noll site 
(which was later shown to bisect exoh 7 of the PAC 125D9 at the beginning of the apoptosis 
inhibitor domain. The Nod PAC fragments were isolated by preparative PFGE and used separately 
to probe fetal brain cDNA libraries. Physical mapping and sequencing of the Not! site region was 
also undertaken to assay for the presence of a CpG island, an approach which rapidly detected 
coding sequences. The PAC 125D9 was also used as a template in an exon trapping system 
resulting in the identification of the exons contained in the neuronal apoptosis inhibitor protein 
gene. 

The multipronged approach, in addition to the presence of transcripts identified previously 
by hybridization by clones from the cosmid array (such as, GA 1 and L7), resulted in the rapid 
identification of six cDNA clones contained in neuronal apoptosis inhibitor protein gene. The 
clones were arranged, where possible, into overlapping arrays. Chimerism was excluded on a 
number of occasions by detection of co-linearity of the cDNA clone termini with sequences from 
clones derived from the PAC 125D9 partial Sau3Al genomic library. 

Cloning of Neuronal Apoptosis Inhibitor Protein Gene 

A human fetal spinal cord cDNA library was probed with the entire genomic DN A insert of 
cosmid 250B6 containing one of the 5 CATT subloci. This resulted in a detection of a 2.2 kb 
transcript referred to as G A 1 . Funher probings of fetal brain libraries with the contiguous cosmid 
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inserts (cosmids 40G) ) as well as single copy subclones isolated from such cosmids were 
undertaken. A number of transcripts were obtained including one termed L7. No coding region was 
detected for L7 probably due to the fact that a substantial portion of the clone contained unprocessed 
heteronuclear RNA. However, it was later discovered that L7 proved to comprise part of what is 
believed to be the neuronal apoptosis inhibitor protein gene. Similarly, the GA1 transcript 
ultimately proved to be exon 13 of the neuronal apoptosis inhibitor protein. Since GA1 was found 
to contain exons indicating lhat it was an expressed gene, it was of particular interest The G Al 
transcript which was contained within the PAC clone 125D9 was subsequently extended by further 
probing in cDNA libraries. 

The remaining gaps in the cDNA were completed and the final 3 1 extension was achieved by 
probing a fetal brain library with two trapped exons. A physical map of the cDNA with overlapping 
clones was prepared. The entire cDNA sequence is shown in Table 1 and contains 18 exons (1 to 
1 4a and 1 4 to 17). The amino acid sequence starts with methionine which corresponds to the 
nucleotide triple! ATG. 

DIVA Man ipulaticn an d Analysis 

Four genomic libraries containing PAC 125D9 insert were constructed by BamHl, 
BamHl/Notl, total and partial Sau3al {selected for 5kb insert size) digestions of the PAC genomic 
DNA insert and subcloned into Bluescripl vector. Sequencing of approximately 400 bp of both 
termini of 200 five kb clones from the partial Sau3Al digestion library in the manner of Chen et al. 
( 1 993) was undertaken. 

Coding sequences from the PACs were isolated by the cxon amplification procedure as 
described by Church et al (1994). PACs were digested with BamHl or BamHl and BgUl and 
subcloned into pSPL3. Pooled clones of each PAC were iransfecied into COS- 1 cells. After a 24h 
transfection total RNA was extracted. Exons were cloned into pAMPl 0 (Gibco, BRL) and 
sequenced utilizing primer SD2 (GTG AAC TGC ACT GTG ACA AGC TGC). 
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DNA sequencing was conducted on an ABI 3 73 A automated DNA sequencer. Two 
commercial human fetal brain cDNA libraries in lambda gt (Stratagene) and lambda ZAP 
(Clomech) were used for candidate transcript isolation. The Northern blot was commercially 
acquired (Glontech) and probing was performed using standard methodology. 

In general, primers used in the paper for PGR were selected for T m s of 60°C and can be used 
with the following conditions: 30 cycles of 94°C, 60s; 60°C, 60s; 72°C f 90s. PCR primer mappings 
are as referred to in the figure legends and text. Primer sequences are as follows: 
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! 258 ATg CTT ggA TCT CTA gAA Tgg - Sequence ID No. 3 

1285 AgC AAA gAC ATg Tgg Cgg AA - Sequence ID No. 4 

1 343 CCA gCT CCT AgA gAA AgA Agg A - Sequence ID No. 5 

1 844 gAA CTA Cgg CTg gAC TCT TTT • Sequence ID No. 6 

1 863 CTC TCA gCC TgC TCT TCA gAT - Sequence ID No. 7 

1864 AAA gCC TCT gAC gAg Agg ATC - Sequence ID No. 8 
1884 CgA CTg CCT gTT CAT CTA CgA - Sequence ID No. 9 

1886 TTT gTT CTC CAg CCA CAT ACT - Sequence ID No. 1 0 

1 887 CAT TTg gCA TgT TCC TTC CAA g - Sequence ID No. 1 1 
1893 gTA.gAT gAA TAC TgA TgT TTC ATA ATT - Sequence ID 



No. 12 • 

1910 TgC C AC TgC CAg gCA ATC TAA Sequence ID No. 1 3 
1919 TAA ACA ggA CAC ggT ACA gTg - Sequence ID No. 14 
1 923 CAT gTT TT A AgT CTC ggT gCT CTg - Sequence ID No. 1 5 

1 926 TTA gCC AgA TgT gTT ggC ACA Tg - Sequence ID No. 1 6 

1 927 gAT TCT ATg TgA TAg gCA gCC A - Sequence ID No. 1 7 
1 93 3 gCC ACT gCT CCC gAT ggA TTA - Sequence ID No. 1 8 
1974 gCT CTC AgC TgC TCA TTC AgA T - Sequence ID No. 1 9 
1 979 ACA AAg TTC ACC ACg gCT CTg - Sequence ID No. 20 
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Our genetic and mapping analysis of SMA has led 10 -the identification of the 154 kb insert 
of PAC125D9 as the likely site of the SMA gene. We report here the complete DNA sequence of 
ihe 131 kb portion of the PAC125D9 insert which contains both NAIP and SMN* as well as the 3' 
end of a copy of the Basic Transcription Factor gene BTF2p44.* PAC125D9 insert digested with a 
variety of restriction enzymes was used to generate nine libranes. Shotgun sequencing of clones 
from the Sau3Al library was hampered by the Alu rich nature of the area, sequencing was therefore 
conducted by a modified transposon based approach 10 yielding the configuration depicted in the 
figure. The NAIP and SMN ,d genes, separated by 15.5 kb, are in a tail to tail (5'">3*:3'<~5') 
orientation, spanning 56 kb and 28 kb of genomic DNA, respectively. The gene BTF2p44 exists in 
a number of copies on 5ql3.1 10 ; exons 1 1-16 of one BTF2P44 copy occupy the most 5' eleven kb of 
the PAC insert followed by an 1 1 kb interval before NAIP exon 2. The first NAIP exon as 
originally reported 3 is not present in this PAC and may have been a heteronuclear artifact. An 
approximately 3 kb section of the 15.5 kb interval between NAIP and SMN (CGA T figure) is 
transcribed but contains no protein coding sequence. Indeed, no coding sequence in addition to 
BTF2P44, NAIP and SMN was identified throughout the entire interval 

GpG islands were identified in the 5' region of both SMN and NAIP genes. One hundred 
and forty five Alu sequences were identified in the 131 kb sequence, with five clusters of high 
density seen (figure legend). Such Alu density associated with LI paucity (five copies) is in 
keeping with previous findings for light Giemsa staining (or reverse) chromosomal bands 11 . Copies 
of other repeats (e.g. MIR2, MST and MER) as detected by Sequin program are also as depicted 12 . 
The polymorphic microsatellite loci previously mapped to the SMA region; (CMS1 1 \ CATT U or 
C161 '\ C171 1 \ C272 ,s or AG-1 ,w7 ) as well as unusual single and di-nucleotide repeats are as 
shown. 

The full length NAIP cDNA (6228 bp with an ORF of 4212 bp) was also elucidated by 
cDNA sequencing and comparison with PAC sequence, comprising 17 exons encoding a predicted 
1 56 kDa protein of 1 403 amino acids (data not shown). A novel NAIP exon 14 between the original 
exon 14 and 15 was identified. The original exon 17 has been replaced by a novel exon which 
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contains (he stop codon. a 1 .6 Kb 3" UTR region and the polyadenylation consensus site (AATAAA) 
identified by 3* RACE. No new protein domains are found in the NAIP gene. 

A rigorous definition of how far deletions extend on type 1 SMA chromosomes is central to 
our understanding of disease pathogenesis. If the genotype most frequently observed on type 1 
SMA chromosomes (i.e. absence of NAIP exons 4 and 5 as well as SMN U| exons 7 and 8) arc the 
result of a single event, then our sequencing suggests a minimal deletion si« of 60 kb. The high 
deletion frequency on type 1 SMA chromosomes of the CATT-40G1". (which maps between NAIP 
exon 7 and 8) is consistent with such a deletion. 

Southern blots containing genomic DN A probed with NAIP cDNA reveal a diversity of 
bands, a result of the polymorphic number of variant forms of this locus mapping to 5ql3.1 J ". In 
contrast, the same blots probed with SMN cDNA reveals only the bands associated with the intact 
SMN locus, for SMA and non-SMA individuals alike. Thus, there is no evidence of truncated or 
partially deleted SMN genes such as seen with the NAIP gene. The absence of any detectable SMN. 
junction fragment in SMA patients strongly suggests that the SMN* 1 exon 7 and 8 deletion detected 
in the significant majority of SMA cases incorporates the entire SMN" 1 gene, thus extending the 
putative minimal SMA type I deletion to approximately 100 kb (figure). This is in keeping with the 
high deletion frequency of C272' 5 (or AG-1" ' 7 ) microsatellite (which maps to SMN exon I , figure) 
on type I SMA chromosomes. A 15% deletion frequency of one copy of BTF2P44 is observed in 
all SMA cases irrespective of clinical severity 9 , suggesting that this mutation may not be an 
extension of the putative SMN-NAIP deletion. Clarification of this issue must await details of 
which copy of p44 is deleted. 

Our sequencing of PAC125D9 maps the intact NAIP locus and clinically relevant SMN"' to 
a 100 kb region which contains those microsatellite polymorphisms that are preferentially deleted 
on the significant majority of type 1 SMA chromosomes (i.e. CATT-40G1' 4 C272 1 * or AG-1 
The absence of any protein coding sequence, other than NAIP and SMN in this interval, focuses 
attention on these two genes as the key modulators of type 1 SMA. One potential pathogenic model 
is that SMN'" absence acts as the primary neurotoxic insult" with NAIP depletion/absence leading 
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10 an attenuated apoptotic resistance 5 - 6 , exacerbating motor neuron attrition. Presence of additional 
SMN ctA may also act to modulate the course of the disease 20 . In addition to aiding in our 
comprehension of the molecular pathology of acute SMA, the sequence presented here should help 
in the study of transcriptional control elements for both genes, possibly facilitating the formulation 
of genetic therapies for this devastating neuromuscular disease. 

DNA Sequencing 

Partial Sau3Al (selected for 3-5kb) BamHI, EcoRI, HindlH, PstI, SstI, Xbal and EcoRV 
libraries) were made from the PAC125D9 insert and sequenced using a transposon-based 
methodology (TN 1 000 Gold Biotechnology 10 ). Subcloning of a large number of inserts into the 
commercially supplied pMOB plasmid was found to be problematic, therefore pUCl 8 and 
pBluescript SK were used. In general, fewer than 10% of clones had transposons in the vector 
region. E . coli lysate was employed as sequencing template using our modified heat soaked 
protocol". Sequencing was from the TNIOOO transposon randomly inserted into the target DNA, 
using primers of opposite orientation <5'-ATA TAA AGA ACGAAT TAT CTG C-3'; 5'-GTA TTA 
TAA TCA ATA AGTTAT ACC-3'), generating approximately 1 kb of sequence with a 5 bp 
overlap, easily spanning 300bp Alu repeats. Our approach permitted sequencing of inserts as large 
as 14kb. 

As the SMA region is known to be unstable, special care to ensure an intact, unaltered PAC 
insert was undertaken primarily by comparison of PAC125D9 insert and genomic DNA 
hybridization patterns on Southern blots. 

Raw DNA sequence data generated by our automated sequencers (ABI 373 and ABI 373A) 
were processed and assembled in parallel by the Sequencher 3.0 program (Gene Codes Inc.); and 
the GAP4 program from the Staden package' 1 . The edited results were automatically converted into 
GCG file formats" and placed in a separate database for searches by outside users using our e-mail 
server at smafastaffimgcheo.med.uottawa.ca. GRAIL" and Blast* searches were employed to 

screen for protein coding sequence and the PROSITE Protein database^ was used to search for m 

protein domains. 
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Eaampfe S NAIP Expression Vectors 

Using the identified NAIP sequence information, a full length 3.7 kbNAIP construct tagged 
with the myc epitope (1) MTG-SP3.7, a 2.5 kb Bsu36VSalI fragment of NAIP cloned into Bluescript 
and (ii) Bsu361/XhoI cut MTG-SE1 .7, the expression vector pcDNA3 containing a 300 bp myc 
epitope and a 1 .7 kb fragment of NAIP were ligated. HeLa, CHO and Ral-I celts were transfected 
by lipofection (Gibco BRL) with 8 DNA and G41 8 resistant transformanls were selected by 
maintaining the cells in 250 /ig/mt. 400> < g/ml and 800,/g/ml G41 8 respectively. 

In a second approach, cells were infected with adenovirus alone or adenovirus expressing 
either NAIP, anlisense NAIP, or LacZ. For construction of the adenovirus, a 3.7 kb BamHI 
fragment of NAIP was cloned into the Swai site of the adenovirus expression cosmid pAdexlCAwi. 
The antisense NAIP RNA contains a sequence complementary to the region of an mRN A 
containing an initiator codon. Expression of NAIP was confirmed in both procedures by Western 
blot analysis and immunofluorescence. Following infection with the recombinant adenoviruses, 
CHO cells were induced to undergo apoptosis by serum deprivation with survival rates of 48V. (no 
insert), 51% (LacZ) and 45% (antisense NAIP) at 48 hours (Fig, la). In contrast, CHO cells 
infected with adenovirus expressing NAIP demonstrate 78-83% survival. NAIP also induced 
survival in stably transfected CHO pools, albeit slightly less than thai seen in adenovirus infected 
cells: 44% of the vector transfcctants and 65% of the NAIP transfectants survived at 48 hours (Fig. 
lb). Next, overexpression of NAIP in CHO cells treated with 20 mM menadione (a potent inducer 
of free radicals) resulted in 20-30% enhancement of survival compared with controls after 24 hours 
(Figs, lc, Id). Overexpression of NAIP also protected.menadione treated Rat- 1 fibroblasts from 
undergoing cell death (Figs, le, If, Ig, Ih). Only 15% of cells infected with LacZ expressing 
adenovirus were viable at 12 hours in contrast to 80% of NAIP infected cells, an effect also detected 
■with the pooled Rat-1 NAIP transfectants. Even greater survival was induced by NAIP 
overexpression at a lower menadione concentration (5 M M), with 98% of pooled NAIP transfectants 
and 33% of control transfectants viable at 24 hours (Figs. Ig. lb). Also assessed was the protective 
effect of NAIP on cells exposed (o the cytokine TNF-ec. HeLa cells treated with TNF-a and 
cyclohexamide were protected from apoptosis when infected with adenovirus expressing high levels 
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of NAJP (139%) at 48 hours, an effect not observed with antisense NAIP (52%) (Figs. li, 1j). A 
similar effect was observed in pooled HeLa transformants. 

To confirm that cells surviving the apoptotic agents expressed NAIP, immunofluorescence 
with anti.-M.AlP amisera was performed on a number of the cell death assays. Immunofluorescence 
is a technique which localizes proteins within a cell by light microscopy by the use of antibodies 
specific for a desired protein and a fluorescence microscope. Dyes can be chemically coupled to 
antibodies directed against purified antibodies specific for a desired protein. This flourescent dye- 
antibody complex when added to permeabilized cells or tissue sections binds to the desired antigen- 
antibody which lights up when illuminated by the exciting wavelength. Fluorescent antibodies may 
also be microinjected into cultured, eel Is for visualization. Using immunofluorescence, CY-3, a 
dye which emits red light, was coupled to a secondary antibody used to detect the bourn anli-NAlP 
antibodies. A dramatic enrichment of NAIP expressing cells was observed, with no alteration noted 
in the cytoplasmic distribution of NAIP. These data offer strong support for the apoptotic 
suppression activity of NAIP. 

E*ar qpk6 Cellular Distribution of NAIP using NAIP Antibodies 

It was previously demonstrated (Roy, N. el. al. The gene for NAIP, a novel protein with 
homology to baculoviral inhibitor of apoptosis, is partially deleted in individuals with spinal muscle 
atrophy. Cell 80: 1 67-1 78 ( 1 995).) by reverse transcriptase PCR analysis that the NAIP transcript is 
present in human spinal cord. To define more precisely the cellular distribution of NAIP, a 
polyclonal antiserum was raised against NAIP. The NAIP antibodies were then used in both 
immunocytochemistry and immunofluorescence techniques to visualize the protein directly in cells 
and tissues in order to establish the subcellular location and tissue specificity of the protein. 

The ability of the polyclonal antibody to detect NAIP was confirmed by 
immunofluorescence of cells transfected with myc tagged NAIP employed both the anti-NAlP and 
anti-Myc antibodies, as well as western blot analysis on protein extracts of these cells {Fig. 1). in 
the wesiern blotting technique, proteins are run on polyacrylamide gel and then transferred onto 
nitrocellulose membranes. These membranes are then incubated in the presence of the antibody 
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(primary), then following washing arc incubated to a secondary antibody which is used for detection 
of the protein-primary antibody complex. Following repeated washing, the entire complex is 
visualized using colorimeiric or chemiluminescent methods. A protein of the expected molecular 
weight was detected by both antibodies in western blots and their cellular co-localization 
demonstrated by immunofluorescence. Sections of human spinal cord stained with ami-NAIP 
showed strong immunoreactivity in the cytoplasm of the anterior horn cells and intermediolateral 
neurons (Figs. 3a and 3b). Consistent with the motor neuron staining, NAIP reactivity was 
observed in the ventral roots which contain motor axons but not the dorsal roots comprised of 
sensory axons (Figs. 3c and 3d). The observation of motor neuron staining correlates well with a 
role for the protein in the pathogenesis of SMA. However, the presence of NAIP in 
intermediolateral neurons which are not reported to be affected in SMA. implies heterogeneity in 
the apoptotic pathways between the two classes of neurons. 

OltKT Fmiffldimyntt 

In other embodiments/the invention includes any protein which is substantially identical to a 
mammalian NAIP polypeptides provided in Figs, 6 and 7. Seq. ID NOS: 22 and 24); such homologs 
Include other substantially pure naturally-occurring mammalian NAIP proteins as well as allelic 
variants; natural mutants; induced mutants; DNA sequences which encode proteins and also 
hybridize to the NAIP DNA sequences of Figs. 6 and 7, (Seq. ID NOS: 21 and 23) under high 
stringency conditions or, less preferably, under low stringency conditions (e.g., washing at 2X SSC 
at 400C with a probe length of at least 40 nucleotides); and proteins specifically bound by antisera 
directed to a NAIP polypeptide. The term also includes chimeric polypeptides that include a NAIP 
portion. The sequence of Seq. ID No. 1 and the IAP proteins are specifically excluded. 

The invention further includes analogs of any naturally-occurring NAIP polypeptide, 
Analogs can differ from the naturally-occurring NAIP protein by amino acid sequence differences, 
by post-translational modifications, or by both. Analogs of the invention will generally exhibit at 
least 85%, more preferably 90%, and most preferably 95% or even 99% identity with all or pan of a 
naturally occurring NAIP amino acid sequence. The length of sequence comparison is at least 1 S 
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amino acid residues, preferably at least 25 amino acid residues, and more preferably more than 35 
amino acid residues. Modifications include in vivo and in vitro chemical derivati^tion of 
polypeptides, e.g., acetylation, carboxylation, phosphorylation, or glycosylation; such modifications 
may occur during polypeptide synthesis or processing or following treatment with isolated 
modifying enzymes. Analogs can also differ from the naturally-occurring NAIP polypeptide by 
alterations in primary sequence. These include genetic variants, both natural and induced (for 
example, resulting from random mutagenesis by irradiation or exposure to ethanemethylsulfate or 
by site-specific mutagenesis as described in Sambrook, Fritsch and Maniatis, Molecular Cloning: A 
Laboratory Manual (2d ed.), CSH Press, 1989, or Ausubel et aL, supra). Also included are cyclized 
peptides, molecules, and analogs which contain residues other than L-arriino acids, e.g„ D-aminb 
acids or nonnaturally occurring or synthetic amino acids, e.g., B or y amino acids. In addition to 
full-length polypeptides, the invention also includes NAIP polypeptide fragments. A$ used herein, 
the term "fragment," means at least 20 contiguous amino acids, preferably at least 30 contiguous 
amino acids, more preferably at least 50 contiguous amino acids, and most preferably at least 60 to 
80 or more contiguous amino acids. Fragments of NAIP polypeptides can be generated by methods 
known to those skilled in the art or may result from normal protein processing (e.g., removal of 
amino acids from the nascent polypeptide that are not required for biological activity or removal of 
amino acids by alternative mKNA splicing or alternative protein processing events). 

Preferable fragments or analogs according to the invention aTe those which facilitate specific 
detection of a NAIP nucleic acid or amino aciu sequence in a sample to be diagnosed. Particularly 
useful NAIP fragments for this purpose include, without limitation, the amino acid fragments shown 
in Tabic 2. 
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What is claimed is: 

•1. A method of inhibiting apoptosis in a cell, said method comprising administering to said 
cell an apoptosis inhibiting amount of NA1P polypeptide. 

2. A method of inhibiting apoptosis in a mammal, said method comprising providing a 
transgene encoding a NA1P polypeptide or fragment thereof to a cell of said mammal, said 
transgene being positioned for expression in said cell 

3. A method of inhibiting apoptosis in a cell, said method comprising administering a 
compound which increases NAlP biological activity. 

4. The method of claim 2, or 3 wherein said cell is in a mammal. 

5. The method of claim 4, wherein said mammal is a human. 

6. The method of claim I or 2, wherein said cell is in a mammal diagnosed as being HIV- 
positive, or as having AIDS, a neurodegenerative disease, a myelodysplasia syndrome, or an 
ischemic injury. 

7. The method of c laim 6, wherein said ischemic injury is caused by a myocardial 
infarction, a stroke, a reperfiision injury, or a loxin-induced liver disease, physical injury, renal 
failure, a secondary exsaunguination or blood flow interruption resulting from any other primary 
diseases. 

8. The method of claim 1, 2, or 3, wherein said cell is a muscle cell. 

9. The method of claim I or 2, wherein said muscle cell is a myocardial cell. 

10. The method of claim 1 or 2, wherein said muscle cell is a renal cell. 

1 1 . The method of claim 1 or 2, wherein said muscle cell is a neuron. 

1 2. The method of claim 2 wherein said transgene encodes N AIP. 

13. The method of claim 6, wherein said mammal is HIV-positive or has AIDS. 



WO 97/76331 



PCT/IB^/00142 



14. The method of claim 13, wherein said cell is a T cell. 

15. The method of claim 14, wherein said T cell is a CD4* T cell. 

16. The method of claim 6, wherein said mammal has a neurodegenerative disease. 

17. The meihod of claim 6, wherein said mammal has an ischemic injury, 

18. A method for increasing apoptosis in a cell, said method comprising administering a 
compound which decreases NAIP anti-apoplotic activity. 

1 9. The meihod of claim 1 8, wherein said compound is NAIP antisense RNA. 

20. The method of claim 18, wherein said compound is an antibody which specifically 
binds NAIP. 

2 1 . A substantially pure nucleic acid encoding a NAIP polypeptide. 

22. The nucleic acid of claim 21, wherein said nucleic acid is mammalian. 

23. The nucleic acid of claim 22, wherein said mammal is a human. 

24. The nucleic acid of claim 21, wherein said nucleic acid is genomic DNA or cDNA. 

5 25 . A substantially pure DNA having the sequence of Fig. 6 y or degenerate variants thereof, 

and encoding the amino acid sequence of Fig. 6. 

26. Substantially pure DNA having about 50% or greater nucleotide sequence identity to the 
DNA sequence of Fig. 6, 

27. The DNA of claim 26, wherein said nucleotide sequence identity is 75% or greater. 

10 28; A purified DNA sequence substantially identical to the DNA sequence shown in Fig. 6, 

29 The DNA of claim 21 , wherein said DNA is operably linked to regulatory sequences for 
expression of said polypeptide and wherein said regulatory sequences comprise a promoter. 
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30. The DNA of claim 29, wherein said promoter is a constitutive promoter, is inducible by 
one or more external agents, or is cell-type specific. 

31. The nucleic acid of claim 21, wherein said nucleic acid comprises a deletion of the 
nucleic acids encoding the carboxy terminal amino acids of NAIP. 

5 32. A vector comprising the nucleic acid of claim 21, said vector being capable of directing 
expression of the peptide encoded by said nucleic acid in a vector-containing cell. 

33. A cell that contains the DNA of claim 21. 

34. The cell of claim 33, said cell being present in a patient having a disease that is caused 
by excessive or insufficient cell death. 

10 35. The cell of claim 33, said cell being selected from the group consisting of a fibroblast, a 

neuron, a glial cell, an insect cell, an embryonic stem cell, a myocardial cell, and a lymphocyte. 

36. A transgenic cell that contains the DNA of claim 21, wherein said DNA is expressed in 
said transgenic cell. 

37. A Transgenic animal generated from the cell of claim 33, wherein said DNA is expressed 
15in said transgenic animal. 

38. A substantially pure mammalian NAIP polypeptide, or fragment thereof, 

39. The fragment of claim 38, wherein said fragment comprises the three BIR domains of 
NAIP and lacks at least a portion of the carboxy terminus of NAIP. 

40. The polypeptide of claim 38, said polypeptide being encoded by the nucleic acid of 
20claim 17. 

4t. The polypeptide of claim 38, said polypeptide comprising an amino acid sequence 
substantially identical to an amino acid sequence shown in Figs. 6 or 7. 

42. The polypeptide of claim 38, wherein said polypeptide is a mammalian polypeptide. 
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43. The poiypcptide of claim 38, wherein said polypeptide is a human polypeptide. 

44. A therapeutic composition comprising as an active ingredienl a NA1P polypeptide 
according lo claim 38, said active ingredient being formulated in a physiologically acceptable 
carrier. 

5 45. The composition of claim 44, said active ingredient being a NAIP polypeptide encoded 

by the nucleic acid of claim 17. 

46. A method of delecting a NAIP gene in an animal cell, said method comprising 
contacting the nucleic acid of claim 17, or a portion thereof that is greater than about 1 8 nucleotides 
in length, with a preparation of genomic DNA from said animal cell, said method providing 

i o detection of DNA sequences having about 50% or greater nucleotide sequence identity with the 
sequence of Fig. 6. 

47. The method of claim 46, wherein said detecting is to diagnose a condition involving 
altered levels of apoptosis. 

48. The method of claim 47, wherein said condition is Amyotrophic Lateral Sclerosis. 
15 49. A method of obtaining a NAIP polypeptide, said method comprising: 

(a) providing a cell with DNA encoding a NAIP polypeptide, said DNA being positioned fo 
expression in said cell; 

(b) culturing said ceil under conditions for expressing said DNA; and 

(c) isolating said NAIP polypeptide. 

2 0 50. The method of claim 49, wherein said DNA further comprises a promoior inducible by 

one or more external agents. 

,51. A method of isolating a NAIP gene or portion thereof having sequence identity to 
human NAIP, said method comprising amplifying by PCR said NAIP gene or portion thereof usin 
oligonucleotide primers wherein said primers 
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(a) are each greater lhan 1 3 nucleotides in length; 

(b) each have regions of complementarity to opposite DNA strands in a region of the 
nucleotide sequence of either Fig. 6; and 

(c) optionally contain sequences capable of producing restriction enzyme cut sites in the 
Samplifted product; and isolating said NAIP gene or portion thereof. 

52 . A method of isolating a NAIP gene or fragment thereof from a cell, said method 
comprising: 

(a) providing a sample of cellular DNA; 

■(b) providing a pair of oligonucleotides having sequence homology to a conserved region of 
10a NAIP gene; 

<cj combining said pair of oligonucleotides with said cellular DNA sample under conditions 
suitable for polymerase chain reaction-mediated DNA amplification; and 

(d) isolating said amplified NAIP gene or fragment thereof. 

53. The method of claim 52, wherein said amplification is carried out using a reverse- 
l5transcription polymerase chain reaction. 

54. The method of claim 53, wherein said reverse-transcription polymerase chain reaction is 

RACE. 

55. A method of identifying a NAIP gene in a mammalian cell, said method comprising: 
(a) providing a preparation of mammalian cellular DNA; 

20 <b) providing a detectably-labelled DNA sequence having homology to a conserved region 
of a NAIP gene; 
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(c) contacting said preparation of cellular DNA with said detectably-labelled DNA sequence 
under hybridization conditions thai provide detection of genes having 50% or greater nucleotide 
sequence identity; and 

56. The method of claim 51, 52, or 55 wherein said DNA sequence comprises at least a 
5ponion of exon 14a or exon 17 of NAIP. 

57. A NAIP gene isolated according to a method comprising: 

(a) providing a sample of cellular DNA; 

(b) providing DNA sequence, said sequence comprising a pair of oligonucleotides having 
sequence homology to a conserved region of a NAIP gene absent in Seq. ID No. 1 ; 

1 o (c) combining said pair of oligonucleotides with said cellular DNA sample under conditions 

suitable for polymerase chain reaction-mediated DNA amplification; and 

(d) isolating said amplified NAIP gene or fragment thereof. 

58. A NAIP gene isolated according to the method comprising: 

(a) providing a preparation of cellular DNA; 

15 (b) providing a detectably-labelled DNA sequence having homology to a conserved region 

of a NAJP gene absent in Seq. ID No. I; 

(c) contacting said preparation of cellular DNA with said detectably-labelled DNA sequence 
under hybridization conditions providing detection of genes having 50% or greater nucleotide 
sequence identity; and 

2 0 (id) identifying a NAIP gene by its^ssociation with said delectable label. 
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59, A method of identifying a NAJP gene, said method comprising; 
(a) providing a mammalian ceil sample; 

.(b) introducing by transformation into said ceil sample a candidate NAIP gene; 

(c) expressing said candidate NAIP gene within said cell sample; and 

5 (d) determining whether said sample exhibits an altered level of apoptosis whereby an 

alteration in the level of apoptosis identifies a NAIP gene. 

60. The method of claim 59, wherein said cell sample is selected from the group consisting 
of a lymphocyte, a fibroblast, an insect cell, a glial cell, a myocardial cell, an embryonic stem cell, 
and a neuron. 

10 61. A purified antibody that binds specifically to a NAIP polypeptide. 

62. A method of identifying a compound that modulates apoptosis, said method comprising: 
(a) providing a cell expressing a NAIP polypeptide; and 

<b) contracting said cell with a candidate compound and monitoring the expression of a 
NAIP gene, an alteration in the level of expression of said gene indicating the presence of a 

1 Scompound which modulates apoptosis. 

63. The method of claim 62, wherein said NAIP gene is human NAIP. 

64. The method of claim 63, wherein said cell is a myocardial cell expression. 

65. A method of diagnosing a mammal for the presence of disease involving altered 
apoptosis or an increased likelihood of developing a disease involving altered apopiosis, said 

2 omethod comprising isolating a sample of nucleic acid from said mammal and determining whether 

said nucleic acid comprises a NAIP mutation, said mutation being an indication that said mammal 
has an apoptosis disease or an increased likelihood of developing a disease involving apoptosis. 
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66. A method of diagnosing a mammal for the presence of a disease involving altered 
apoptosis or an increased likelihood of developing a disease involving altered apoptosis, said 
method comprising measuring NAIP gene expression in a sample from said mammal, an alteration 
in said expression relative to a sample from an unaffected mammal being an indication that said 

Smammal has an apoptosis disease or increased likelihood of developing an apoptosis disease. 

67. The method of claim 65, wherein said NAIP gene is human NAJP, 

68. The method of claim 65, wherein said gene expression is measured by assaying the 
amount of NAIP polypeptide in said sample. 

69. The method of claim 66, wherein said NAIP polypeptide is measured by immunological 
1 Omethods or by assaying the amount of NAIP RNA in said sample. 

70. A kit for diagnosing a mammal for the presence of a disease involving altered apoptosis 
or an increased likelihood of developing a disease involving altered apoptosis. said kit comprising a 
substantially pure antibody that specifically binds a NAIP polypeptide. 

7 \ , The kit of claim 70, further comprising a means for detecting said binding of said 
15 antibody to said NAIP polypeptides. 

72. A method of inducing apoptosis in a ceil, said method comprising administering to said 
cell a negative regulator of the NAIP-dependent anti-apoptotic pathway. 

73. The method of claim 72, wherein said negative regulator is a purified antibody or a 
fragment thereof that binds specifically to a NAIP polypeptide. 

20 74. The method of claim 73, wherein said negative regulator is a NAIP antisense mRNA 

molecule. 

75/ A NAIP nucleic acid for use in modulating apoptosis. 
76. A NAIP polypeptide for use in modulating apoptosis. 
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77. The use of a NA1P polypeptide for the manufacture of a medicament for the modulation 
of apoptosis. 

78. The use of a NAIP nucleic acid for the manufacture of a medicament for the modulation 
ofapoptosis. 

5 79. A method of treating SMA in a patient, said method comprising administering a 
polypeptide having at least two BIR domains ofanami-apoptotic protein. 

80. A method of treating SMA in a patient, said method comprising administering a nucleic 
acid encoding a polypeptide having at least two BIR domains of an ami-apoptqtic protein. 

81. The method of claim 79 or 80, wherein said polypeptide has at least thiee BIR domains. 
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>HSU19251, 5502 bases, 79F5B1F2 checksum. 5502 nt vs. 

>naip-eeq, 6133 bases, FD809DB checksum. 6133 at 

77.8% identity; Optimized score: 13374 

10 20 30 40 50 60 

naip-O TTCCGGCTGGACCmX^CCTGTGTACCTCTTC 

naip.s T- -- ' — ' " 

70 80 90 100. 110 120 

naip-o GGGTATTGACCCCAGACAXCAATGCCXCTTCATATTGCATGAAGXCAAAAGGTCCTGTGC 

::::ttt:::: 

-_ iT , fl - — GCATGAAGXCAAAAGGTCCTGTGG 

naiP '* 10 20 

130 UO 150 160 170 180 

naip-O TCACCTGGGACCCTTCTGGACtnTGCCCTGTGTTC 

::::::::: j :t: :::: J : J s :::::::: ::?::::«»:::::::: 
naip.s TCACCl^aGGACCCTTCTGGACGTTGCCCTGTGTACCTCTTCGACTGCCTGTTCATCTACO 
30 40 50 60 70 B0 



190 200 210 220 230 240 

naip-o j^c^CCCCGGCTATTGACCCCA^ 

::::i::::::::s:Mn»:!:s::::!:::i:!-J!!!»" :!!S:!:!:,!!,!! 
naip.s ACGAACCCCGGGTATI^CCCCAGACAACAATGCCACTTCATATTGGGGACrrcGTCTGG 
v iaa nn tin 130 140 



250 260 270 280 290 300 

naip-O GATTCCAAGGTTCJITTCATTGCAAAG^^ 

::::::: j :::::::::::::::: t ::: : s s s i s t :« s s s 8 : s 1 * : Ui*!jl 111 ? * 
naip . B GATTCCAACXM'GCATTCATTGCAAAGTTCCTTAAATAT'n^rCT 

150 160 170 180 190 200 

310 320 330 340 350 360 

naip-o O^CGGACAGAt^TTTGTTC^^ 

.... ..... . • : • , t ::::::::::::: it a 1 1 mi J x an t : : s : : : : : J * » » » 1 1 : 

naip . 8 ^CG^CA(»GCATTTGTTC!maW3CCACATACTT^^ 

210 220 230 240 250 260 

370 - 380 390 400 410 420 

naip-O TCTATTAGACTAGAACTGTGGATAAACCXOkGAAAATGGCCACCCAGCAGAAAOCCTCTG 
: j : i : s : : t : j i : s : 1 1 : s : s : : : : : : s : : t » * 1 1 : s : s s : : » : « s * t » s : : : t : : : J » : 
naip . S TCTATTAGACTAGAACTGTGGATAAACC TCAGAAAATGGC CACCCAOCAGAAAGCCTCT6 
270 280 290 300 310 320 

430 440 450 460 470 480 

naip-o ACGAGAGGATCTCC CAGTTIX^TCACAATTT<XrrcCCAGAXX;TGTCTGC TCTTCTGGQCC 
r t s : s j : : : : i s : : : : : s : : : : : : : : s : s : s s : s * s s « • * * ' xsssssstsstssssss: 
naip . S ACGA1&<^TCTCCCAGTTTGATCACA^^ 

330 340 350 360 370 380 
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490 500 ■ 510 .520 530 540 

naip-o TAGATTCAGTTCAGTTGGCJU^^ 

I::::::::::::::::::::;::::: : :::::::::: : : : : : : ; :::::::::::::: : 
TAGATGCAGTTCAGTTGGCAAAGGAACTAGAAGAAGA 

390 400 410 420 430 440 



naip.s 



550 560 570 580 590 600 

naip-o TGCAGAAAGCXTACAACTCTCA^ 

::::::::::::::::::: t : : ::::::: J : : : : : : ::::::::::::: : : : 
naip* 6 TCKl&GAAAGGCTACAACTCTCAAATGCGC^ 

450 460 470 480 490 500 

610 620 630 640 650 660 

naip-o CTTATGAGCCGTACAGCTCATGGATACCACAGGAGATGGC 

: : : :::::::: s s 1 1 1 : t : i : : : : : s : : i : : 1 1 :::::::: : : s : : : 
naip.B C TTATGAGC CGTACAGCTCATGGATACCACAGGAGATGGCGGC C GCTGGGTTTT ACTTCA 
510 520 530 540 550 560 

670 6B0 690 700 710 720 

naip-o CTGGGGTAAAATCTGGGATTCAGTGCTTCTGCTC 

st : : : : : : : : ::::::::::: : : i I : : : t m t St it i u : s : : : s : : ; : t : : : : 
naip.fi CTGGGGTAAAATCTGGGATTCAGTGC TOCTGCTGTAGCCTAATCCTCTTTGGTGCCGGCC 
570 580 590 600 610 620 

730 740 750 760 770 780 

naip-o TCACGAGACTCCX:CATAGAAGACCJLCAAGAGGTTTCATC 

:s ::::::::::: :::s:::: 8 8 8 : t :: 
naip.S TOICGAGACTCCCCATAGAAGACCACAAGAGGTTTCATCCAGATO 

630 640 650 660 670 680 

790 800 810 820 830 640 

naip-o ACAAGGATG1TGGT AACATTGCCAAGTAC GACATAAGGCTGAAGAATCTGAAGAGCAGGC 
t : : : : : : : : : is : s i » 1 1 s : :ss i: » s: : : : : J t t i t : : s s : « t s s : :s : s s : : : 
naip.S ACAAGGATGOTGGTAACATTGCCAAGTACGACATAAGGGT^ 

690 700 710 720 730 740 

850 860 870 880 890 900 

naip-o TGAGAG<^GCTAAAATGAGGTACCAAGA^ 

:: j :::::::: j j 2 t t : s 8 : : 1 1 : - x : 1 1 
naip ■ s TGAGAGGAGCTAAAATGAGGTACCAAGAAGAGGAG^ 

750 760 770 780 790 800 

9X0 920 930 940 950 960 

naip-O GGCCATTTTAWHCCAA 

tstx88;:i'tti»si8si8S&'8:-s::it:tiiis:ss«istst:iS'iti 8 ; : 1 1 : : : : : s 
naip.S GGCCATTTTA ^ ^ ^ ^ ^ 

970 980 990 1000 1010 1020 

naip-o CAGGTAAACAGGACACGGTACAGTGTTI^^ 

2i2223*2222*2!°S3«*2»2«»" 2 • 8 2 2 8 8 8 8 8 8 8 8 8 8 8 • » 8 8 8 8 • 8 8 ■ • 

naip . s CAGGTAAACAGXSACACGGTACAGTGTTTTTCCTGTGG 

870 880 890 900 910 920 
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1030 1040 - 1050 1060 107Q 1080 

naip-o AAGGAGATGATC C TTCG^GGAACATCCCAAATGGTTCCC CAAATGTGAATTTCTTC QGA 
::::::::: s ::::::: r::i: : ::::s:: s : i ::: : 
jiaip,s MGGAGATGATCCTTGGAAC^ 

930 940 950 960 970 980 

1090 1100 1110 1120 1130 1140 

naip-o GTAAGAAATCCTCAC^GGAAATT^ 

:::::::::::: s :::::::: i : : : : : : : : : s : nUi : : :::::: : : : : : : : : s : s : : i 
naip.s GTAAGAAATCCTCA(^C&AAA™ 

990 1000 1010 1020 1030 1040 

1150 1160 1170 1180 1190 1200 

naip-o TAACGGGAGAACAlTTT^K^aTC 

::;:::::::::!::::::::::::::::::::: • « s * :::: : : : : :: : 

naip.S TAACGGGAGAACATTOTGTGAATTCCTGGG^ 

1050 1060 1070 1080 1090 1100 

1210 1220 1230 1240 1250 1260 

naip-o ATTGCAATGACACXATCTTTGCTTACGAAGAACT 

: ::::::: i :::::::::: 3 1 t ::: : ; : i : : t : : : i ::: : : : : : : : 
naip.fl ATTGCAATGACAGCATCTTTGCTTA^ 

1110 1120 1130 1140 1150 1160 

1270 1280 1290 1300 1310 1320 

naip-o CCCGGGAATCAGCTGTGGGAGTTGCAG^ 

:::::::::::: :mmii t j $:::::» st :::::::::: : ; t it 1 1 1 t i : i : : : i * * 
naip.s CCCGGGAATCAGCTGTGGGACnTCCAGCACT^ 

1170 1180 1190 1200 1210 1220 

1330 1340 1350 1360 1370 1380 

naip-O TAAAGGACATCGTCCAGTGCTTTTCCTGTGGAGGGTGTOT 

....... • : ; • ♦ : : s t s :::::::::: s : t n : * itt t s s t : : : t : : : 

naip.fi TAAAGGACATCGTCCAGTGCTTTTCCTGTGGAGGGTG 

1230 1240 1250 1260 1270 1280 

1390 1400 1410 1420 1430 1440 

naip-O ATGACCCATTAGAC GATCACJ^CAGXTGTTTTCCCAATTGTCCA 

:::: t:::::: :::::::::: ::::tt: u-t t: :: r:::::r:n t: :::::::::: 
naip.6 ATGACCCATTAGACGATCACACCAGATGTlTrc 

1290 1300 1310 1320 1330 1340 



naip-o 



1450 1460 1470 1480 1490 1500 

AGTCCTCTGCGGWIGTGACTCC^GAC^ 

::::::::::::::::: s ts it : : : ; t x : : s : ; ti 
naip . E AGTCCTCTGCGGAACTTIACTCCAGACCTTCAG 

1350 1360 1370 1380 1390 1400 

1510 1520 1530 1540 1550 1560 

naip-o AAACCACAAGTGAAAGCAATCTTCAAGACT 

naip.B AAACCACJU^^ 
>-i 1410 1420 1430 1440 1450 1460 
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1570 1580 1590 1600 1610 1620 

naip-0 TGGCACAGGGTGMKCCAGTGGT^ 



naip I s TGGCAGAGGGTGAAGCC CAGTGGTTTC^ TGAGAG 
1470 1480 1490 1500 1510 1520 

1630 1640 1650 1660 1670 1680 

naip-o CAGCTTATAC CAGC GCCAGTTTCCGC CACATCTC TTTGCTTGATATCTCTTCCGATCTGG 
::::::::::::::::::::: r : : : : 2 : : : tit :: j ::::: 2 2 2 : 
naip • 8 CAGCTTATACCAGCGCCAGTTTCC GCCACATGTCnTGC TTGATATCTC TTCCGATC TGG 
1530 1540 1550 1560 1570 1580 

1690 1700 1710 1720 1730 1740 

naip-O CCACGGACCACTTGCTGGGCTGTGATCTGTCTATTGCTTCAAAACAC^ 

::::::: 2 x ::::: : : : : : : :t t : : : : : : 2 : 2 : : j 2 2 2 2 2 2 : 2 2 2 : : : 2 : : : : : 
naip . S CCACGGACCACTTGCTGGGCTGTGATCTGTCTATTGCT^^ 

1590 1600 1610 1620 1630 1640 

1750 1760 1770 1780 1790 1800 

naip-o TGGAAGAACCTCTGGTGCTGCCTGAGCTCT^ 

2 ::: 2 : s : : 2 : : 2 : : 2 2 2 J 2 2 2 1 : 2 : 2 2 2 

naip.fi TGCAAGAACCTCTGCTGCTGCCTGAG^ 

1650 1660 1670 1680 1690 1700 

1810 1820 1830 1840 18SQ 1860 

naip-o AGGGTGAAGCTGGAAGTGC3AAAGAC GOT^CTCCTGAAGAAAATAfiCTrTTCTGTGGGCAT 

2:222222222:::::: 222 2 ; 222: 2:2 2 : : 2 ::: : :: 2 2: : ;:: : : 2 : : 2 2 2 : 222 
naip. 8 AGGGTGAAGCTGGAAGTGGJUUU^GGTCCTCCTGAA 

1710 1720 1730 1740 1750 1760 

1870 1880 1890 1900 1910 1920 

naip-O CTCGATGCOOTKXCCTGTOAAACAG 

:: 2 2 22:2222: 22221:2 it 2222:2222:22:2:2:2222:22222:222 
naip . 8 CTGGATGCTGTTCCCCTGTTAAAX^AGGTTCCAGCTGGT^^^ 

1770 1780 1790 1800 1810 1820 

1930 1940 1950 I960 1970 1980 

naip-o CCAGACCAGACGAGGGGCTGGCCAGTATCATCTGTGACCAGCTCCTAGAGA 



naip.B CCAGAC CAGACGAGGGGCTGGCCAGTATCATCTGTGACCAGCTCCT 

1830 1840 1850 I860 1870 1880 

1990 2000 2010 2020 2030 2040 

naip-o CTGTTACTGAAATGTCCATGAGGAACATTATCCAQ 

■2:JSSXSSi!5S*S2S»:S52rS5:Si5S:!l5J58 
naip.S CTGTTACrGAAATGTGCATGAGGAACATTATCCAGCAGT^ 

1890 1900 1910 1920 1930 1940 

2050 2060 2070 2080 2090 2100 

naip-O TTTTAGATGACTACAAAGAAATATGTTCAATCCCTC 

: : : : : t : : : : : : 2 : : : : : t : : : : 2 : : : s : : 2 : : s : : : : : 2 s : s : : s 2 : : : : : : : : : s 2 
naip . 8 TTTTAGATGACTACAAAGAAATATGT1CAA 

1950 1960 1970 1980 1990 2000 
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2110 2120 2130 2140 2150 2160 

naip-o A^CCACTTATCCCGGACClWCTArr^TTGCTGTCCGTACAAA^GG^CAGGG^ 



naip.e 



AAAACCACWATCCCGGACCTGCCTATTGATTGCTGTCCGTACAAJlCAGGGCCAGGGACA 
2010 2020 2030 2040 2050 2060 



2170 2180 2190 2200 2210 2220 

naip-o TCCGCCGATACCTAGAGACCATTCTA^GATCCAAGCATTTCCCTTTTATAATACTGTCT 
........... j : at n : : : t : : :t t :: : : :: : i :::::::: : 

nait>. 8 1^MC^TACCTAGAGAC^TTCTA(^GATCAAAGCATTTCCGTTTTATAATACTGTCT 
2070 2080 2090 2100 2110 2120 

2230 2240 2250 2260 2270 2280 

naip-O CTATAWACGOAMCTCTmC^ 

naio.S CTATAWACGGAAKTCTTTTCA^ 

2130 2140 2150 2160 2170 2180 



2330 2340 




naip-o 

::: j;r:i:n:>is:rt»j»i::::!s:is:::i:-J !!!SS " !:: * t,:,,:,:s!: 
noip.S TTGGAAAGAACCAAXGTTTGCAGAAGATACAGAAAAC TCCTCTCTTTGTGGCGGCGATCT 

2190 2200 2210 2220 2230 2240 

2350 2360 2370 2380 2390 2400 

naip-o ctgctcattggtttcagtatccttttoacccatccwtgatgatgtOoctgttttcaaot 

::::::!i::iJJ!::ti!::!iitJ!::::: !:Ji:!t:ins : : t : 1 1 \ ; 8 t : $ 

naip.8 GTCCTCArTGGrm^ATCCT^^ 

2250 2260 2270 2280 2290 2300 

2410 2420 2430 2440 2450 2460 

naip-O CCTATATGGAACGCCTTTCCTOAAGGAACAAAGCGACAGCTGAAATT^ 

:! s s 5 ' 5 ! { 1 5 s : 5 : 5 ! 5 2 1 : 5 5 : : : 1 1: 5 
na ip . 8 C CTA TATGGAAC GC CTTTCC TT AAGGAACAAAGCGACAGC TGAAATTCTCAAAGCAACTG 
2310 2320 2330 2340 2350 2360 

2470 2480 2490 2500 2510 2520 

naip-o TGTCC TC CTGTGGTGAGCTGGCC TTGRAAGGO" "l"l"i"i"i'l'CATGTTGCTTTGAGTTTAATG 
:: ttiti :::::::::: it: :ss :s : : : s s : : : : : : : : : i»ni» Mii 8 x viit xxt : 
naip s TGTCCTC CT0TGCTGAGCTG(^CTTGAAAGGCrCTTTTTTCATOn<!CrrTGA0TOTAA.TO 
2370 2380 2390 2400 2410 2420 

2530 2540 2550 2560 2570 2580 

naip-O AT^TGXrcTCGCAGAAGCAGGGGTTGATGAAGATGAAGATCTAACCATGTOCTTGATGA 
:::::::::::::::::: t i a t : : : : : l m 1 1 1 ; s s : i :::::: i.» tt i :::::::: : 
M^'I^TCTCG^GAAG^GGGGTTG^ 

2430 2440 2450 2460 2470 2480 

2590 2600 2610 2 620 2630 2640 

naip-o GCAAATTTACAGCC CAGAGACTAAGAC CATTCTACCGGTTTTTAAGTCCTGCCTTCCAAG 

:::::: ::::s:ss::::5!S5J s,lj::S!t!:!:ssI ■ • • * ♦« •• 

naic . 8 GCAAATTTACAGCCCAGAC^CTAAGACCAITCTACCGGTPTTTAAGTCCTGCCTTCCAAG 
,: ' 2490 2500 2510 2 520 2530 2 540 
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2650 2660 2670 2680 2690 2700 

naip-o aattik:ttgcggggatgaggctgattgaactcctggatt^ 

naip . s AA TTTC TTGC GGGGATGAGGCTGATTGAACTC C TGGATTCAGATAGGCA GGAACATCAAG 
2550 2560 2570 2580 2590 2600 

2710 2720 2730 2740 2750 2760 

naip-O ATTTGG<^CTGTATCATTTGAAACAAATCAACTCACCCATGRTGACTGTAAGCGCCTACA 

• » 4 • • 4 • « III *• I | t | •* M I M I t I * I I • I * I I t I ( • i t • i • • ( { I 4 « I f i i 

naip . fi ATTTCGC^CTGTATCATTTG^ 

2610 2620 2630 2640 2650 2660 

2770 2780 2790 2800 2810 2820 

nadp-o ACAATTTTTTGAACTATGTCTCCA 

naip , s ACAAITTTTTGAACTATGTCTCCAGCCTCCCT^ 

2670 2680 2690 2700 2710 2720 

2830 2840 2850 2860 2870 2880 

naip-o CTCATTTGCTCCATTOACn^ 

I:::::::::::::::::::::::::;:;:::::?:::::::::::::::::::::;::: 
naip.s CTCATTTGCTCCATTTAGTGGATAACAAAGAGTCATTC^ 

2730 2740 2750 2760 2770 2780 

2890 2900 2910 2920 2930 2940 

naip-o ACTACTTAAAGC^CAGCCAGAAATTTGACT 

::::::::::: :::: i ::::::::::: s : 2 s s : s ::: s s ::::: ; : s : : : : : 
naipifl ACTACTTAAAGC^CAGCCAGAAATTTCACTGCAGA 

2790 2600 2810 2820 2830 2840 

2950 2960 2970 2980 2990 3000 

naip-O AAATTT^CACAAGCTTAC 

s :::::: : t r ::::::: s :::: s : : : ::::::: : : : : : : : s 
naip.s AAATTTGTCCACAAGCTTACTTTTGAATGGTTTCAGA^ 

2850 2660 2870 2880 2890 2900 

3010 3020 3030 3040 3050 3060 

naip-O AAACTGCTTATCAAAGCAACAJCTGTTGCTGGGTGTTC 

• .tiitiitfiiii<!ii*iii«|«ttf»i|«*«*i(iti({!Hi**H! ,, t l !lff(ft 

naip . 8 AAACTGCTTATCAAAGaUUACTGTTGCTGCGTGTTCTCCA^^ 

2910 2920 2930 2940 2950 2960 

3070 3080 3090 3100 3110 3120 

naip-o AAGGGAGAACACTCACTTTGGGTGCGCTT 

a »••••■■«■ ■ ■««•••••. • « ■ a a • ■ ■•• 

• «•••••»••«••■•>*■*«• •••««*•*•*■•■••••••••••«••■■«■••-•««. • a ■ • 

naip.s AAGGGAGAACACTGACTTTGGGTGC GCTTAACOTACAGTACTZTTTCGACCACCCAGAAA 
2970 2980 2990 3000 3010 3020 

3130 3140 3150 3160 3170 3180 

naip-o GCTTCTCATTGTTGAGGAG^ 

naip . 8 GCTTGTCATTGTTGAGGAGCATCCACTTCCCAATACGAGGAAATAAG^ 

3030 3040 3050 3060 3070 3080 



Fig. 5F 
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3190 3200 • 3210 3220 3230 3240 

naip-o CACATTTTTCACTTCTGGAAACATGTTTTGACAAATCACAGGTGCCAACTATAGATCAGG 

:::::::::::: : : : : j!ji::mi:i:iJUU«8iu:iii::tiw«u»?ni 
naip.6 CACATTTTTCAGTTCTCGAAACATCTTTTGXCA^TCACAGGTGCCAACTATAGAl^AGG 

3090 3100 3110 3120 3130 3140 

3250 3260 3270 32B0 3290 3300 

naip-O ACTATGCTTCTGCCTrTGAACCTATGAATGAATGG^ 

• • •*•••« w »•*•■•••-■ ♦ • • * l"5 ■ .• • S $ • • S t 5 155S ••••••■■»■ • ♦ 

naip.s ^ATOCWCTOCCTTTGJUra 

3150 3160 3170 3180 3190 3200 

3310 3320 3330 3340 3350 3360 

naip-o ATAATGTAAAGAGCTATATGGATATGCAGCGCJtf^^ 

naip.s ATAATCTAAAGAGCTATAtt^^ 

3210 3220 3230 3240 3250 3260 

3370 3380 3390 3400 3410 3420 

naip-o ATTGGAAACTTTCTCCAAAGCAGTACAAGATTTC^ 

:: - . s j i j ::::::::::: : it- : s ::::::: : ! i tt * » :::::: : 

na ip . 8 ACTGGAAACirrCTCCAAAGCAGTACAAGATTCCCTGTCTAGAAGTCfiATGTGaATQ&TA 

3270 3280 3290 3300 3310 3320 

3430 3440 3450 3460 3470 3480 

naip-O TTGATGTTGTAGGC CAGGATATGCTTGAGATTC TAATGACAGTTTTCTCAGCTTCACAGC 

::!::i:iit<ti!:t::::'.:::::::M::>::tt:::: tint ««i is: 

naip . S TTGATGTTGTAGCKCAGGATATGCTTGAGATTCTAATGACAOTTTTCTCAGCTTCACAGC 
3330 3340 3350 3360 3370 3380 

3490 3500 3510 3520 3530 3540 

naip-o GCATCGAACTC CA'n^ AAACCACAGCAGAGGCTTTATAGAAAGCATCCGCCCAGCTCTTC 

: s : : i 1 1 1 1 : : : J : s s t x t : : s : : : : : i 1 i J * ' «"* 8 3 8 1 8 8 8 8 8 8 1 8 : 8 : 5 8 * : * 1 * 8 * 
naip.s GCATCGAACTCCATTTAAACCACAGCAGAfiGCTTTATAGAAAGCATCCGCCCAGCTCTTG 

3390 3400 3410 3420 3430 3440 

3550 3560 3570 3580 3590 3600 

naip-o AGCTGTCTAAOGCCTCTCTCACCAAGTGCTC CATAAGCAAGTTGGAACTCAGCGCAGCCG 

till : ; : : : : : s j : : : : i : ; : 8 « 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 1 8 8 8 8 
naip.s AGCTCT^AAGGCCTCTGTCACCAAGTGCTC^ 

3450 3460 3470 3480 3490 3500 

3610 3620 3630 3640 3650 3660 

naip-o AACAGGAAC TGCTTCTCACCCTGCCTTC CCTGGAATCTCTTGAAGTCTCAGGGACAATCC 

: : : : : j : s : : : : s : : t : : : : : : : : : : : : : : s : 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 : 
naip . s AACAGGAACTGCTTCTCACCCTGCCTTCCCTGGAATCTCTTGAAGTCTCAGG6ACAATCC 
3510 3520 3530 3540 3550 3560 

3670 3680 3690 3700 3710 3720 

naip-o AGTCACAAGACCAAATCTTTCCTAATCT^ 

naip . s AG^CAA^CCAAATCTiTCCT^ 
,, ' :! 3570 3580 3590 3600 3610 3620 

Fig. 5G 
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3730 3740 ' 3750 37 60 3770 3780 

naip-o TGGATCTGGAGGGCMTATAAATGTTTTTT^ 

::::::::::::::::::::::::::::::::::::::::: .:.»■« : : ; M ::::::::: 
naip/B TGGATCTCGAGGGCAATATAAATGTTTTTTCAGTCATTCC 

3630 3640 3650 3660 3670 3680 

3790 3800 3810 3820 3830 3840 

naip-o ACCATATGGMA^W 

: : : : : : ; : : : : :::::::::::::::::: V; ; : : : 5 : : :i : i s : ; it ::::::::::::: 
naip . 6 ACCATATC^GAJ^TTAra 

3690 3700 3710 3720 3730 3740 



naip.B AATTAATTCAAAATTCTCCAAAC^ 

3750 3760 3770 3780 3790 3800 



naip-o 



naip/s ATTTTGGGTCTCTCATGACTATGCTTGTTTCCT 

3810 3820 3830 3840 3850 3B60 

3840 3850 3860 

naip-q — TGCCAGTTOKCAAATraTATTTCTCTGA 

: : : t : : : : its :::::: : : : : : 
naip.S CGGATTCATTTITTCAAGCCGTCCCATTTGTTGC^^ 

3870 3880 3890 3900 3910 3920 

3870 3880 3890 3900 3910 3920 

naip-o AGATATTAAATC TTGAAGGCCAGCAATITCCTGATGAGGAAAGATC^ 

: :: : 5: : i: t ; ::::::::: :: : : : s: : s : : : : : v: 
naip.B AGATATTAAATCTTGAAGGCCAGGAATrrCC 

3930 3940 3950 3960 3970 3980 

3930 3940 3950 3960 3970 3980 

naip-o ACATTTTAGGTTCTCTTACT 

: : : : : 5 s s : it : : : : 1 : : 1 i : : 1 5 : $ 1 : s s : 1 s j i 1 1 1 1 1 1 it : 1 1 : : s r t : : : : * : tj: 
naip . £ ACATTTTAGGTTCTCTTAGTAACCTC^^ 

3990 4000 4010 4020 4030 4040 

3990 4000 4010 4020 4030 4040 

naip-0 ATCGAGTGGCCAAACTGATCATCCAGC^ 

na ip . 8 ATC^GTC^<^J^CT(^ 

4050 4060 4070 4080 4090 4100 

4050 4060 4070 4080 4090 4100 

naip-0 CATTTTTCAAGACTTTGAATGAT^ 

: :: : : s 2 2 :::::: : : : : : j : x x : : : : s : : : : : : ; it m : : t 

naip.S CATTTTTCAAGACTTTGAATGATGACAGCGTGGTO 

4110 4120 4130 4140 4150 4160 
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4110 4120 4130 4"0 4150 

M ip-o -_--TCTGCAG^CAC-AGGACOT---GTC 

naip.s fikGoirtcS^^ 

4170 4180 4190 4200 4210 4220 



42.01/ «jlov " — " ^ 

naip-O GAGTT- GACAGTCCCATGGCATACTCTTCCA- ATGGCAAAGT GAAT GACAAGC 

:: : t : ; ::s : : »:s :::: : »» » 
naip.s GATACAGAAATTTCTTTCAAGCACTGGACAACATGCCAAACTTG^^ 

4230 4240 4250 4260 4270 4280 

4210 4220 4230 4240 : 

naiD~o GGTTTTATCCAGAGTCTTCCTA - - - TAAATCCACGCCGGT- - --TCCTGAAGT 



naip.s CCAGGCATTCACAGAG^ 

F 4300 4310 4320 4330 4340 



4290 4300 4310 

4250 4260 4270 4280 4290 

naip-O --GGTTCAGGAGCTTCCA TTA-ACTTCGCCTGTGGA TGACTTCAGGCAGCC 

• • • :: :t: :::::» t s*: : ss :» " 

naip.s GTOG^ACGA^ajC^^ 

4300 4310 4320 4330 4340 

M i«-o TC-GTTACAGCAGCG- — - -GlXMTAACTTTGAGACACCPrCAAAAAGAGCAC 

v ... ... , ii it .» t s s » : s s s « :« »: 

naip B GATGATATTGCATTGCTTAATGTCATGAAAGAAAGACATCCTCAATCTAAGTA^ 

4410 4420 4430 4440 4450 44€0 

4350 4360 . 4370 4380 4390 

naip-O CTGCA — AAGGGA-AGAGCAGGAAGGTCAAAGAGAACAGAGC AAGAT-CA-CTA 

.. ::: : s: : ?: «*« ! :{,J! M 5,: 

naip . S ATTCTCCAGAAATGGATACTGCCGTTC TCTC CAATCATTCAGAAATAAAAGATTCAGCTA 
4470 4480 4490 4500 4510 4520 

4400 4410 4420 4430 4440 

naip-o TGAGA- -CAGACTACACAACTGGCGGCGAGTCCTGT - GATGAGCTGGAGGAGGAC -TGGA 
v . . .. . : s: : j i t : : * « s « 1 ' ; 1 s: 

naip.s aaaactgctgUtcaataatttctcttggggcatat^ 

4530 4540 4550 4560 4570 4580 



naip-o TCAGGG AATATCCACC- -TATCACTTCAGAT- - - -CA- ACAAAGACAAC 

naip.s ITAATGCTAAAAACCAAATTATC CLAAAATTATTTTATTAAATATTG<^TACAAAAGAAAA 
' * <-rst\ AC*n A fin 4630 4540 



4490 4500 4510 4520 4530 

n»in-o TCT — — " -ACAAGAGGAATTTTGACACTGGCCTACAGGAATACAAG— 

Fig. 51 
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4540 4550 4560 

AGCTTAC AATCAGAAC — TTGA - TGAG- -ATCAA TA 

: : : : t : : : : : : : : : : : 2 : : : : : : : : : 

AGCTCAAGAAATAAATCATCACCAATAC C TTTGAGGTCCCTGAGTAATC CACCCCAGC TA 
4710 4720 4730 4740 4750 4760 

4570 4580 4590 . 4600 

naip-O AAG- - - AACTCTCCCGTTTGG- - -ATAAAGAA- TTGGATGACTATAGAGAA G 

::: ::::::: : : :::::: :: : 

naip.fi AAGGCAAACCCTTCAATCAAGTTTATACAGCAA^ 

4770 4780 4790 4800 4810 4820 

4610 4620 4630 4640 4650 4660 

jjaip-o AAAGTGAAGAGTACATGGCTGCTGCTG - ATGAATA- - -CAATAGACTGAAGCA- -AGTGA 
: : : : :: : ::: : :::: : ::. s s ti i : mis : i i : 
jiaip . 8 AAGGGGTTCGGGACAGGTCTGCCAATCTATCTAAAAGCCA 

4830 4840 4850 4860 4870 4880 

4670 46B0 4690 4700 

naip-o AGGGATCTGC - AGATTACAAAAGTAA- — GAAGAATCA -TTGCAAGCA G 

:::::::::::::::: I:::::::::::::::::::: 
naip.B ATITATATAATAAATGGCTAACTTAACGGTrGAATCACTTTC^ 

4890 4900 4910 4920 4930 4940 

4710 4720 4730 4740 
naip-O TTAAACAGCAAATTGTCACACATC — — AAGAAGATGGT ■ TGGA 

■■*!*• 3 1 2 • SZISJSJ5S * 5 • 

naip.8 CTT AACACAGGATC CACATGAATCSTCTGTGGGCCAAGA - GATGTTCCTT AATCCT7TGTA 
4950 4960 4970 4980 4990 5000 

4750 4760 4770 

naip-O GA -CTAT— GA--TAG ACAGAA AACATAGAAGGC—TGA 

:: :::: ;:: :::: : :t ? : t 

naip.s GAACCTGTTTTCTATATTGAACTAGC1TTGG TACAGTAjGAGTTAACTOACTTTCCATTTA 
5010 5020 5030 5040 5050 5060 

4780 4790 4800 4810 4820 

jiaip. o T GCCAAGTTGTTTGAGAAA TTAAGTATC'—TGACATCTCTGCAAT- -CT 

: :::::: : ; s : : :: : : : : : : : i s : : : : 

naip.fi TCCACTGCCAATATAAAGAGGAAACAGGGGTTAGGGAAAAATGACTT^ 

5070 5080 5090 5100 5110 5120 

4830 4840 4850 4860 4870 

TCTCAGAAGGCAA- — ATG ACTTTGGACCATAACCCC GGAAGCCAAACCTCTGTGA 

::::::: ::: :tt :::::: : : s:: :;: : iz z: 
TCTCAGAG TTCAACATATGCTATAATTTAGAATTTT - CTTATGAATCCACTCTACT- TGG 
5130 5140 5150 5160 5170 5180 

4880 4890 4900 4910 4920 

naip-O GCATCACAGTTTTGGT TGCTTTAATATCAT — CAGTATTGAAGCATTTTATAA- 

i • « a i • « • • * m m •••••• • t m m •• •••*■•« 

naip . s GTAGAAAATATTTTATC TCTAGTGATTGCAT ATTATTTCCATATCATAGTATTTCATAGT 
5190 5200 5210 5220 5230 5240 

Fig. 5J 
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naip, 6 
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naip.s 
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. 0 , n 4*40 4950 4960 
a.ip-0 *^m»» ATCAAC 1«*CTG« «CTCC«T 

4970 4980. «M 5000 , 

naip-o TAAGGA-TTTTATG— CTTTAAA--CATTGG-«-TTCTTC-TATTA--AG^----"" 

5010 5020 

Baip-O TGAA ATACTGTT TGAGGTTTTT-------AAG r 

naip.s TA*4~^ 

5030 5040 5050 5 060 
.„ „ .ccro — AAA GGAAGGT TCTGGTGTGAACTAAACTTTC A 

»ip-o CK 5 cSL»CG«-tIS^-T»C«---^™x- --.--TTTOTTIOCXTJ. 



5130 

5110 sa^v 
naip-O GGTGATC---TCATTT--- AAT 



5110 5120 ^ — -AACCA 



naip-o w^v-.wu.. ---- tit t 

_.. fl 5150 5160 517 0 

^aip-o CcSSaGATAAC TCTT ATITAT AATCACTTTTTTCCA- - - 

naip.B ^TCTCTACTAA^ 

" 56X0 5620 5630 5640 5650 5660 

5180 5190 5200 5210 

naip-O CATAAGG .««.—-— r - AAACTGGGTT CCTGCAATGAAGTCTCTGAAGTGAA- 

naip-s c^yCTAGGGCGGCTGAGACGCAAGAC^^ 

v 5570 5680 5690 5700 5710 5720 

5220 5230 5240 
naip-O - --ACTGC-ITGTTTCCT AGCAC- ACACTITTGOTT- 



naip-o ------ 

Fig. 5K 
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5250 5260 5270 5280 5290 

naip-c AAGTCTGTTTTATC^CTTCATT^ -TAC - -AG 

: : : : : : : : : : s ; : : s :::::: : : : : : : 
na ip . 6 CAAAAAACAAAAC CACTTATATTGCT AGCTACATT AAGAAWTCTGAATATGTTA 

5790 5800 5810 5820 5830 5840 

5300 5310 5320 5330 

na ip~0 CTA-CTCCTC- — CC ^TACCGCCACCTCCACAGACACGACTCTCCTGGT— - 

j : : : : : : t; : : : 5 :::::: : : x : 

naip.S CTTGCTTGTGGTAACCATTTATAATATCAGAAAGTATATGTACACCAAAA-- CATGTTGAA 
5850 5860 5870 5880 5890 5900 

5340 5350 5360 
naip-o - -TCCATCTCCT-CTGCTGC TTCTAGCTCC • • • CTGC 

:::::: : : ::: M it :: is: 

naip.fi CATCCATGTTCTACAACTGAAATATAAATAArrrT^ 

5910 5920 5930 5940 5950 5960 

5370 5380 5390 5400 

- — TCTGGC-- TTCA AGGTGCGCAGGACCTGCTTCCOTG — GTGA 

2?;:: : : r : : : : :s : : :: 

naip ; E AAAAAAATTTCTGGAAGTTTATATCTAAA^ 

5970 5980 5990 6000 6010 6020 

5410 5420 5430 5440 5450 5460 

naip-O TCCTCTGTAGTCTCCOLCACCC^ 

::: : :: ; t n :::::::: lit a i t ::::: 
naip.B GCCTG - GAAGCCATTCTTAC TTTTCAGTCTCTC CCATTCTGTACTGTTTTTTGTTTTACT 
6030 6040 6050 6060 6070 

5470 5480 5490 5500 

naip-O TCT- r -CCAGC - TCAGACCTCTCCATCAATCCCAACGCA TA -CAC- 

: :::::: it: in: it : 1 1 : : : 

naip.e TTCGTGCCTGCATTATTTTTCTATTTAAAACAAAAATA^ 

6080 6090 6100 6110 6120 6130 



Elapsed time: 0:01:38 

Fig. 5L 
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ACAAAAGGTCCTGTGCTCACCTGOTACCCTTCTG6ACGTTGCCCTGTGTACCTCTTCCAC 
. « /-.^m^^t > p^rririr hTCGA(5AAGCTG 

TGTTTTC 



1 "rCCAGGACACGAGTGGACCCTGGGAAGACCTGCAACGGGACACATGGAGAAGCTG 



TGCCTGTTGATcVcGACGAACCCCGGGTATTGACCCCAGACAACAATGCCACTTCATAT 

fil .._.+—.——+— + ♦ ----+---.——+ 120 

ACGGAC AAGTAGAjTGCTGC TTGGGGC CCATAACTGGGGTCTGTTGTTACGGTGAAGTATA 

TGGGGACTTCGTCTGGGATTCCAAGGTGCATTCATTGCAAA^Hrc^ ^ ^ 

121 ACCCCTGAAGCAGACCCTAAGGn-CCACGTAACTAACGl 




.TTTATAAAAGAG 



ACTGCTTCCTACTAAAGGACGGACAGAGCATTTGTTCTTCAGCCACATACTTTCCTTCCA 

181 TGACGAAGGAT GATT TC C TGCCTGT CTC G TAAAC AAG AAGT C GGTGTATGAAAGGAAGGT 
CTGGCCAGCATTCTCCTCTATTAGACTAGAACTGTGGATAAACCTCAWAAATGGCCACC 



oil , 

* * GACCGGTCGTAAGAGGAGATAATCTGATCTTGACACCTATTTGGAG^^TriTACCGGTGG ^ 



CAGCAGAAAGCCTCTGACGAGAGGATCTCCCAGTMGATCACAATTTGCTGCCAGAGCTG 

301 GTCGTCTWCGGAGACTGCTCTCCTAGAGGGTCAAACTAGIGTTAAACGACGOTCTCGAC 
4 Q Q K A S D E R I SQFDHMLI.FEL 23 

TCTGCTCTTCTGGGCCTAGATGCAGTTCAGTTGGCAAXGGAACTAGAAGAAGAGGAGCAG 

36i + ♦ ♦ ♦ 420 

AGACGAGAAGACCCGGATCTACGTCAAGTCAACCGTTTCCTTGATCTTCTTCTCCTCGTC 

24 S A L L G I. D A V Q L A K E I* E E E B Q 43 

AAGGAGCGAGCAAAAATGCAGAAAGGCTACAACTCTCAAATGCGCAGTGAAGCAAAAAGG 

421 +- + + -- ♦ "" * 480 

TTCCTCGCTCGTTTTTACGTCTTTCCGATGTTGAGAGTTTACGCGTCACTTCGTTTTTCC 

44K ERAKM QKGVNSQMR8E AKR 63 

TTAAAGACTTTTGTGACTTATGAGCCGTACAGCTCATGGATACCACAGGAGATGGCGGCC 

461 4 ......— — — ♦ — ♦ + — ♦ 540 

AATTTCTGAAAACACTGAATACTCGGCATGTCGAGTACCTATGGTGTCCTCTACCGCCGG 

64LK TFV TY EPY S 8 W I ?QEMAA 83 

GCTGGGTTTTACTTCACTGGGGTAAAATCTGGGATTCAGTGCTTCTGCTGTAGCC!TAATC 

541 + ---+ + : ♦ + 600 

CGACCCAAAATGAAGTGACCCCATTTTAGACCCTAAGTCACGAAGACGACATCGGATTAG 

84 A 0 T 1 F T G V K S G I Q C F C C S L I 103 

CTCTTTGGTGCCGGCCTCACGAGACTCCCCATAGAAGACCACAAGAGGTTTCATCCAGAT 
em - - ----+- — — « - - — 4 660 

GAGAAACCACGGCCGGAGTGCTCTGAGGGGTATCTTCTGGTGTTCTCCAAAGTAGGTCTA 
104LPG AGLTRLP IEDHKB FHPD 123 



Fig. 6A 
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TGTGGGTTCCTTTTGAACXAGGATGTTGGTAACATTGCCAAGTACGACATAXOGOTOAAG 

661 + - — + + ♦ ♦ + 720 

ACACCCAAGGAAAACTTGTTCCTACAACCATTCTAACGGTTCATGCTGTATTCCCACTTC 

124 COr LLNK DVGHIAKYD I R V K 143 

AATCTGAAGAGCAGGCTGAGAGGAGGTAAAATGAGGTACCAAGAAGAGGAGGCTAGACTT 

_ 2 - __- + . - + — ■♦.—*-«•'-?*" * -+ "80 

TTAGACTTCTCGTCCGACTCTCCTCCATTTTACTCCATGGTTCTTCTCCTCCGATCTGAA 
144 N L K 8 R L R 0 GKMRTQEEEARL 163 

GCGTCCTTCAGGAACTGGCCATTTTATGTCCAAGGGATATCCCCrtGTGTOCTCTCAGAG 

7B1 -+ + — ♦ 840 

CGCAGGAAGTCCTTGACCGGTAAAATACAGGTTCCCTATAGGGGAACACACGAGAGTCTC 
164 AS P R N W P F T V Q 0 Z S P CVL 8 E 183 

GC TGGCTTTGTC TTT ACAG^GTAAAC AGGACACGG TAC AGT GTT TTTC CTGTGGTGGATGT 
ea\ +- — ♦ — — ----+ .. — --+ 900 

C G ACC G AAAC A G AAA TGTQC ATTTGTCCTGT GC C ATGTCACAAAAAGGACAC CAC CT AC A 
184 A C P V P T G^K Q D I V Q C P S C G G C 203 



TTAi 



GGAAATTTCGAAGAAGGAGATGATCCTTGGAAGGAACATGCCAAATGGTTCCCCJ 



960 



AATCCTTTAAGCCTTCTTCCTCTACTAGGAACCTTCCTTGTACGGTTTACCAAGGGG1 
204 L GNWEEGDDPWKE H A K M F P K 223 

TGTGAATTTCTTCGGAGTAAGAAATCCTCAGAGGAAATTACCCAGTATATTCAAAGCTAC 
561 - — «~--"- — - + 

ACACTTAAAGAAGCCTCATTCTMAGCACTCTCCTMAATGGGTCATATAAGTITCGATG 
224 C B P L R 8 K K S 8 E E I * Q 1f X « S Y 243 

aa'gggatttgttgacataacg^gagaacattttgtgaattcctgggtccagagagaatta 

WCCCTAAACAACTGTATTGCfcCTCTTGTAAAACACTTAAGGACCCACGTCTCTCTTAAT 
244 K G P V D I T ^ E H P V N S W V Q R E L 263 

CCTATGGCATC AGCTTATTGCAATGAC AGCATCTTTGCTTACGAAGAACTACGGCTGGAC ^ 

1081 GGATACCGTAGTCGAATAAC^W 
264 P H A 8 A YCHDSIP ATfEELRLD 283 

1CTTTT AAGGACTGGCCCCGGGAATCAGCTGTGGGACTTGCAGCACTGGCC AAAGCAGGT 

1141 — + ♦ ---- — * * + 1200 

AGAAAATTCCTGACCGGGGCCCTTAGTCGACACCCTCAACGTCGTGACCGGTTTCGTCCA 
284 S P K D W P R B 8 A V 0 V A A L A K A G 303 

CTTTTCTACACA^TATAAAGGACATCGTCCAGTGCTTTTCCTGTGGAGGGTGTTTAGAG 
1 001 --L---~-if — — 1260 

gaaaagatgtgtc£atatttcctgtagcaggtcacgaaaaggacacctcccacaaatctc 

304 LPYTO % IKDIVQCrSCG QCL E 323 
910 

AAATGGCAGGAAGGTGATGACCCATTAGAC(»TCACACCAGATGTTTTCCCAATTGTCCA 

1261 ---« - + _ ^ - - — ----«f u 

TTTACCGTCCTTCCACTACTGGGTAATCTGCTAGTGTGGTCTACAAAAGGGTTAACAGGT 
324 K W O E G D D P L D D H T R C F P N C P 343 
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rTTCTCCAAAATATGAAGTCCTCTGCWAAGTGACTCCAGACC^ 



1321 aaTwmmka^ 

344 P t Q K M K S S A E V T P P L Q 8 R G E 

CTTTGTGAATTACTG^AAACCACAAGTGAAAGCAATCTTfiAACA ^ 
„___-_--+-----(-- + GTGTTCAC ^ CGTTAG ^ CTTCTAAGT » rATCG icAACCA 



CTTTGTGAATTAC " 

1381 

364 L C 



GAAACACTTAATGACfcTTTGGTGTTCACTTTCGTTAtaAiiv.i it- iju«« ^«*v.»* www* 
E li t -TT T T S E S M - E D S I A V G 



383 



CCTATAGTGCCA^TGGCACAGGGTGAAGCCCAGTGGTTT^GAGG^GAATCTC 

1441 cgaVaVcaVggV^ 

384 P 1 V P E^M A Q G E A Q H F Q E A K H L 403 

AATCyVGCAGCTGAGAKAGCWATACCAGCGCCAGWTC 15g0 

1501 ttaVtcgVcgactctcg^^^^ ._, 

404 N E Q L R A A T T S A S F R B M S I L D 

ATC tcttccgatctggccacggaccacttgc^^ 
1561 Vagagaaggcta^ 113 

424 1SSDLA TDHLLGCDL S I ASK 443 

cacatcagcaaacctgtgcaa<»acctctgg^ i68o 
1621 wtactcSttg-^ 463 

444 H I S K P V Q E P L V L P E V P G » L H 

rsmr^ »/?r T^-»»RTGCAAAGACGGTCCTCCTGAAGAAAAT 

+ 1740 



TCTGTCATGTGTGTGGAGGGTGAAGCTGGAAGTGGAAAGACGGTCCTCCTGAAGAAAATA 



1681 agacagtIacacacacctc AB3 

464SVMC VEGEAG80KTVLwK K1 4B3 

<kttttctgtgggcatctggatgctgtcccctgttaaacaggttccagctggw iaoo 
1741 VgaaaagacaccVg^ 503 

484 A F I. W A S C C C P L L M R P Q L V F Y 503 

ctctcccttagttccaccagaccagacgaggggctggccagtat^tctgtgaccagctc i86q 

1801 GAGAGGGAATCAAGGTGCT „, 
504 L S L 8 S T R P D E 0 L A S 1 " ! 

CTAGAGAAAGAAGGATCTGTTACTGAAATGTGCATGAGGAACATTATCCA^^ ^ 

1861 GATCTCTCTCWCCTAGACA^ 
524 L E KEGS V»E MCMRHII«Q- K 543 



!•:* 544 



XATCAGGTCTTA7TCCTTTTAGATGACTACAAAGAAATATGTTCAATCCCTCAAGTCATA ^ 

" T TAGTCw\GMT^ «- 
HQV LPLLODYKEICS1PQV1 563 

Fig. 6C 

SUBSTITUTE SHEET (RULE 26) 



WO 97/26331 



25/42 

GGAAAACTGATTCAAAAAAACCACTTATCCCGGACCTGCCTATTGATTGCTGTCCGTACA 

l$gl + 4-- 4- ' 4—-- • 4 2040 

CCTTTTGACTAAGTTTTTTTGGTGAATAGGGCCTGGACGGATAACTAACGAC AGGCATGT 
564 G K L I Q K N H L S R T C L L I A V R T 583 

AACAGGGCCAGGGACATCCGCCGATACCTAGAGACCATTCTAGAGATCAAAGCATTTCCC 
2041 ------- — 2100 

TTGTCCCGGTCCCTGTAGGCGGCTATGGATCTCTGGTAAGATCTCTAGTTTCGTAAAGGG 
584 H R A R D I R R Y L E T 1 L E 1 K A F P €03 

TTTTATAATACTGTCTGTATATTACGGAAGCTCTTTTCACATAATATGACTCGTCTGCGA 
2101 — - — ----4*---*~---+-. t --t-v---* — — 4w rr *_- ±„+ w „~± mwm m~+ 2160 
AAAATATTATGACAGACATATAATGCCTTCGAGAAAAGTGTATTATACTGAGCAGACGCT 
604 F Y R T V C I L R K L F S H N K T R L R 623 

AAGTTTATGGTTTACTTTGGAAAGAACCAAAGTTTGCAGAAGATACAGAAAACTCCTCTC 

2161 — ------ — — 4— + — 2220 

TTCAAATACCAAATGAAACCTTTCTTGGTTTCAAACGTCTTCTATGTCTTTTGAGGAGAG 
624 KFMV YFG XNQ S L Q K 1 Q K T P L 643 

TTTGTGGCGGCGATCTGTGCTCATTGGTTTCAGTATCCTTTTGACCCATCCTTTGATGAT 

2221 -- — * — ~+- — 4-*~ -4--^-- — + 2280 

AAACACCGCCGCTAGACACGAGTAACCAAAGTCATAGGAAAAGTGGGTAGGAAACTACTA 
644 F V A A I C A H W F Q Y P F D P S F D D 663 

GTGGCTGTTTTCAAGrrCCTATATGGAACGCCTTTCCTTAAGGAACAAAGCGACAGCTGAA 

2281 — - — -■ ^-^-^4-------- -4------ 2340 

CAC CG ACAAAAGT TCA GGAT AT ACCT TGC G GAAAGGAATTCCTT GTTTCGCTGTCGACTT 
664 V A. V F K S Y M E R L S L R N K A T A E 683 

AWCTCAAAGCAACTGTGTCCTCCTCTGGTGAGCTGGCCTTGAAAGGGTTTTTTTCATGT 

2341 ♦ ♦ * - + + 2400 

TAAGAGTTTC GTT GACACAGGAGGACAC CAC TCGACCGGAACTTTCC CAAAAAAAGTACA 
684 1 I* K A T V S S C 6 E L A L X G F F S C 703 

TGCTTTGAGTTTAATGATGATGATCTCGCAGAAGCAGGGGTTGATGAAGATGAAGATCTA 

2401 — -4 +•-< + —4 — — * + 2460 

ACGAAACTCAAATTACTACTACTAGAGCGTCTTCGTCCCCAACTACTTCTACTTCTAGAT 
704 C F B F N 0 D D L A I A 6 V 0 B D E D L 723 

ACCATGTGCTTGATGAGCAAATTTACAGCCCAGAGACTAAGACCATTCTACCGGTTTTTA 

2461 „„._„+-^- — „-4-_- 2520 

TGGTACACGAACTACTCGTTTAAATGTCGGGTCTCTGATTCTGGTAAGATGGCCAAAAAT 
724 TMCLMSKFTAQ R L R P F Y R F L 743 

AGTCCTGCCTTCCAAGAATTTCTTGCGGGGATGAGGCTGATTGAACTCCTGGATTCAGAT 

2521 - —4— + + _-*-^-4 — — — --4 2580 

TCAGGACGGAAGGTTCTTAAAGAACGCCCCTACTCCGACTAACTTGAGGACCTAAGTCTA 
744 S P A F Q E F LAG MR L I E L L DSD 763 

AGGCAGGAACATC AAGATT TGGGACT G T ATCATTT GAAAC AAATC AACTC ACC C ATCATG 

2581 — - „„-4^„ — + 2640 

TCCGTCCTTGTAGTTCTAAACCCTGACATAGTAAACTTTGTTTAGTTGAGTGGGTACTAC 
764 R QE HQ DLGLY HLK Q I H S P M M 783 
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KTCTAAGCGCCTACAACAATTTTTTGAACTATCTCT^ 



2U1 T^C A^CKG^TGTTGTTAAAAAAC TTCATAC AGAGGTC GGAGGGAAGTTGTTTTCGT 
784 T V S A Y H N F L N Y V S S L P S I K A 803 

GGGCCCAAAATTGTGTCTCATTTGCTCCATTTAGTGGATAACAAAGAGT^TTMAGAAT ^ ^ ^ 



2701 ttVc<*?TTT^ 

804 GPK IV8 HLLH t V D NKESLEN 823 

atatctgaaaatgatgactacttaaagcaccagccagaaatttcactgcagatgcagtta 

TAW 
824 I S 

2821 



2761 Va'tagaVot^ „ , m 

I SENDDYLKHQ P E I S L 0 M I 843 

CTTAGGGGATTGTGGCAAATTTGTCCACAAGCTTACTTTTCAATGGTTTCAGAACATTTA 



GAATCCCCTAACACCGTTTAAACAGGTGTTCGAATGAAAAGTTACCAAAGTCTTGTAAAT 
844 LRGLWQX CPQA YFSMVSEHI. 863 

CTWTTCTTGCCCTGAAAACTGCTTATCAAAGC^ 



GAC C AAG AACGGGAC TTTTGAC6AATAG TTTC GTT GTGACAACGACGCACAAGAGGTAAA 
LVIALKTAY Q S H T V A A C S P P 883 



! » : ! 



2881 
864 L 

GTTTTGCAATTC^^ 

CAAAACGTTAAG6AAGTTCCCTCTTGTGACTGAAACCCACGC6AATTGAATGTCATGAAA 
884 V L Q P L Q G A * * ** S * L M L Q Y F 

TTCGACCACCCAGAAAGCTTGTCArrGTTGAGGAGCATCCACTTCCCAATACCAGGAAAT 

3001 ^GCTMT^GTC^ a ,, 
904 rDHPBSL5 LI.RSI HPPIR GS 523 

AAGACATCACCCAGAGCACATTOTTCAGTTCTGGAAACATGTTTTGACAAATCA^WTG 

3061 wctctagtggVtc^ct^ 4 . , 

924 K T S P B A H F S V L E T C P D K S Q V 943 

ccaactatagatcaggactatgcttctgcctttgaacctatgaatgaatgggagcgaaat 

al2 i — -* — — -—4 + + -*-—* 3180 

ggttgatatctagtcctgatacgaagacggaaacttggatacttacttaccctcgcttta 

944 P T 1 D Q 0 Y A S A r E P H M E W E R M 963 

ttagctgaaaaagaggataatgtaaagagctatatggatatg<^gcg<:a<^^tcacca 
aatcgactttttctcctattacatttctcgatatacctatacgtcgcgtcccgtagtggt 

964 I. A E K E D M V K S Y M D M Q R R A S P 983 

gaccttagtactggctattggaaactttctccaaagcagtacaagattccctgtctagaa 

3541 - -«----+-•■--— ---+-•■-- — ••— *r— - ----+ 

CT GG AATCAT GACC GAT AAC CTTT GAAAGAGGTT TCGT CATGTTCTAAGGGACAGATCTT 
984 D I. S T 0 Y W K L S P K Q Y K X P C L E 1003 
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GTCGATGTGAATGATATTGATGTTGTAGGC CAGGATATGCTf GAGATTC TAATGACAGTT 

3301 - ~ - - + - _»----.4----.-----4~---«--*--— +- — - - - - — + — 3360 

CAGCTACACTW^ 

1004 V D V N D I D V V G Q 0 M L E I L M T V 1023 

TTCTCAGCTTCACAGCGCATCGAACTCCATTTAAACCACAGCA6AGGCTTTATAGAAAGC 
3361 — — — ♦ *" + .^4^— -—4 3420 

AAGAGTCGAAGTGTCGCGTAGCTTGAGGTAAATTTGGTGTCGTCTCCGAAATATCTTTCG 
1024 F S A S Q R I E L H L K H S R G F I E S 1043 

ATCCGCCCAGCTCTTGAGCTGTCTAAGGCCTCTGTCACCAAGTGCTCCATXAGCAAGTTG 

3421 4 * : ~-t." — * 3480 

T A GGC GGGTCGAGAAC TC G A CAG AT TCCG G AGAC AGTGGTTC AC GAGGTATTCGTTC AAC 
1044 I R P A L E L S K A S V T K C S I S K L 1063 

GAACTCAGCGCAGCCGAACAGGAACTGCTTCTCACCCTGCCTTCCCTGGAATCTCTTGAA 

34B1 * — + — — — ——4 3540 

CTTGAGTCGCGTCGGCTTGTCCTTGACGAAGAGTGGGACGGAAGGGACCTTAGAGAACTT 
1064 E b -8 A A E Q E L 1* t .T i P S L E S I* E 1083 

GTCTCAGGGACAATCCAGTCACAAGACCAAATCTTTCCTAATCTGGATAAGTTCCTGTGC 

3541 -~ — --------4-- ------~+-~"-----+-"-~"**" f """~ 4 3 600 

CAGAGTCCCTGWAGGTCAGTGTTCTGGTTTAGAAAGGATTAGACCTATTCAAGGACACG 
1084 V S GT IQ SQDQ IFP NLDKFL C 1103 

CTGAAAGAACTGTCTGTGGATCTGGAGGGCAATATAAATGTTTTTTCAGTCATTCCTGAA 
3501 i + 4 ♦ ----- + 3660 

GAC TTTC TTGAC AG ACAC C TAGA C CTC C C GTTATATTT AC AAAAAAGTCAGTAAGGACTT 
1104 LKE L SV DLE G UINVF SVI P E 1123 

GAATT TCCAAACTTC CAC C ATAT GGAG AAATT ATTGATC C AAATTTC AGCTGAGT ATGAT 
3661 -— -~4-— r — + — + 3720 

CTTAAAGGTTTGAAGGTGGTATACCTCTTTAATAACTAGGTTTAAAGTCGACTCATACTA 
1124 E F P M F H H M E K L L 1 Q I S A B Y C 1143 



CCTTCCAAACTAOTAAAATTAATTCAAAATTCTCCAAACCTTCATGTTTTCCATCTGAAG 

3721 - ♦ - — f " — + 3780 

GGAAGGTTTCATQATTTTAATTAAGTTTTAAGAGGTTTGGAAGTACAAAAGGTAGACTTC 

1144 P S K L v K LIQHSPHL HVFHLK 1163 

TGTAACTTCTTTTC GGATTTTGGGTCTCTCATGACTATGCTTGTTTCCTGTXAGAAACTC 

3781 ♦ — ♦ * 3840 

ACATTGAAGAAAAGCCTAAAACCCAGAGAGTACTGATACGAACAAAGGACATTCTTTGAG 

1164 C MFFSP FG S L MTM LVSC KK I* 1183 

ACAGAAATTAAGTTTTCGGATTCATMTCTCAAGCCGTCCCATT ^ 
3841 ~ — ~ — *"* — ■ , ^****"" — — — 4— ————— — 4 4 .4 

TGTCTTTAATTCAAAAGCCTAAGTAAAAAAGTTCGGCAGGGTAAA9AACGGTCAAACGGT 

1184 T E1KFS D S F F Q A V P F V A S L F 1203 

AATTTTATTTCTCTGAAGATATTAAATCTTGAAGGCCAGCAATTTCCTGATGAGGAAACA 

3901 ♦ — ♦ —4—— —♦ — — — + 3960 

TTAAAATAAAGAGACTTCTATAATTTAGAACTTCCGGTCGTTAAAGGACTACTCCTTTGT 

1204 NFIS LKILHLE GQQFPD EET 1223 

Fig. 6F 
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3961 
1224 S 



TCAGAAAXMTtVcW^^ 4020 



4021 isssss^ 1263 

T G yTCC(»GTCCTCTCATTTTTCAAGACTTTG^T<iXTGA^ mo 

■ xVAGVcpcVc^ 1283 

JUUGTAGCAATCAGTGGAGGTTTCCAGAAACTTC^^^ ^ 

4141 VtVca^aV^ 1303 

AAGATTACAGAGGAAGGATACAGAAATTTCTTTCAAGCACTGGACAACATG^ (](D 

4201 VtVtaa^ 1323 

CACGAGTTGGACATCTCCAWCATTTCACAGACT 4320 



1304 K 



4261 — 

1324 Q 



««Sj5^^ 13 43 
.MGTCTTTGAGTCAATGTGTGTTACGACTACCAAGGCTCATT^ ^ 

4321 't^GAA^ 1363 
1344 KSLS QCVLRLPRI»X»f»" 

TGGCTCTTGGATGCAGATGATAWGCATTGCTTAATGTCATGAAAGA^ i<w 

4361 wVgVgaaV^ 1363 
1364 WLLOADBJALLHV MKERHPQ 

TCTAAGTACTTAACTATTCTCCAGAAATGGATACTGCCGTTC ^ 
gUTIiT-I-tO-KV 8 

TAAAAGATTCAGCTAAAAACTGCTGAATCAATAATWGTCTTGGXSGCATATTGAGGATCT 

1404 



4441 

1384 S 

rAAAAACTGCTGAATCAATAATTTGTCTTGGGGCATAITGAGGATGT 
4501 aYtTTCTAAGTCGA^^ 1423 



XAAWU^OTTGTTC^TTAATGCTAAAAACCJ^M 4620 

456i + "tVc'gaVttttggtttaataggttttaataaaataat^ 



vtgctaaaaac caaattatcc aaaattattttattaaatatt 

TCTTTTTCAACAACTAATTJ 
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GCATACAAAAGAAAATGTGTAAGGCTTGCTAAAAAACAAAACAAAACAAAACACAGTCCT 

4621 -r— -4 — — 4 i + 4€B0 

CGTATGTTTTCTTTTACACATTCCGAACGATTTTTTGTTTTGTTTTGTTTTGTGTCAGGA 

GCATACTCACCACCAAGCTCAAGAAATAAATCATCACCAATACCTTTGAGGTCCCTGAGT 

4681 — 4 4 + 4740 

CGTATGAGTGGTGGTTCGAGTTCTTTATTTAGTAGTGGTTATGGAAACTGCAGGGACTCA 

AATC CACCCC AGCTAAAGGC AAAC CCTTCAATC AAGTTTATACAGCAAACCCTCCATTGT 

4741 + ~+ — — 4 ♦ ♦ 4800 

TTAGG TGGGGTC G ATTTC C GTTT GGGAAGTT AG TTCAAAT ATGTC GTTTGGGAGG TAAC A 

CCATGGTCAACAGGGAAGGGGTTGGGGACAGGTCTGCCAATCTATCTAAAAGCCACAATA 

4801 +— — — - 4~~ 4--- ~-'~+«. 4 4860 

GGTACCAGTTGTC CC TTCCCCAACCC CTGTCCAGACGGTTAGATAGATTTTCGGTGTTAT 

TGGAAGAAGTATTCAATTTATATAATAAATGGCTAACTTAACGGTTGAATCACTTTCATA 

4861 - --4--~ +.,<.*..—».»+--- + 4920 

ACCTTCTTCATAAGTTAAATATATTATTTACCGATTGAATTGCCAACTTAGTGAAAGTAT 

CATGGATGAAACGGGTTTAACACAGGATCCACATGAATCTTCTGTGGGCCAAGAGATGTT 

4921 + + 4- ♦ ~-~*+* + 4980 

GTACCTACTTTGCCCAAATTGTGTCCTAGGTGTACTTAGAAGACACCCGGTTCTCTACAA 

CCTTAATCCTTGTAGAACCTGTTTTCTATATTGAACTAGCTTTGGTACAGTAGAGTTAAC 

4981 4 -4 4 4 4 4 5040 

GGAATTAGGAACATCTTGGACAAAAGATATAACTTGATCGAAACCATGTCATCTCAATTG 

TTACTTTCCATTTATCCACTWCAATATAAAGAGGAAACAGGGGTTAGGGAAAAATGACT 

5041 — + — - + ♦ — -+ 5100 

AATGAAAGGTAAATAGGTGACGGTTATATTTCTCCTTTGTCCCCAATCCCTTTTTACTGA 

TCATTCCAGAGGCTTCTCAGAGTTCAACATATGCTATAATTTAGAATTTTGTTATGAATC 

5101 .—-4 — ~+ 5160 

AGTAAGGTC TC CGAAGAGTC TC AAGTTG TATAC GATATTAAATCTTAAAAGAATACT TAG 

C ACTCTAC TT GGGTAG AAAAT ATTTT ATCTCTAGT GATTGCATATTATTTCCATATC ATA 

5161 — — 4-- -4- * --4---- 4 5220 

GTGAGATGAACCCATCTTTTATAAAATAGAGATCACTAACGTATAATAAAGGTATAGTAT 

GTATTTCATAGTATTATATTTGATATGAGTGTCTATATCAATGTCAGTGTCCAGAATTTC 

5221 — 4 4— ——4 •—4— 4— 4 5280 

CATAAAGTATCATAATATAAACTATACTCACAGATATAGTTACAGTCACAGGTCTTAAAG 

GTTCCTACCAGTTAAGTAGTTTTCTGAACGGCCAGAAGACCATTCGAAATTCATGATACT 

5281 — — 4 V 4-— 4 4 — - 4~ —-4 5340 

CAAGGATGGTCAATTCATCAAAAGACTTGCCGGTCTTCTGGTAAGCTTTAAGTACTATGA 

ACTATAAGTTGGTAAACAACCATACTTTTATCCTCATTTTTATTCTCACTAAGAAAAAAG 

5341 - -4 —4 4 4 —4 4 5400 

T GATAT TC AAC G ATT TGTTGGTATG AAAATAGGAGT AAAAAT AAG AGTG ATTC TT TTTTC 



Fig. 6H 



SUBSTITUTE SHEET (RULE 26) 



PCT/IB97/O0M2 

* '* * WO 97/26331 



30/42 



TCAACTCCCCTCCCCTTGCCCXAGTATGAAATATAGGGACAGTATGTATGGTGTGGTCTC 

5401 — rf + *" * 

AGTTGAGGGGAGGGGAACGGGTTCATACTTTATATCCCTGTCATACATACCACACCAGAG 



ATTTGTTTAGAAAACCACTTATGACTGGGTGCGGTGGCTCACACCTGTAATCCCAGCACT 

5461 •:.:--+ + - — + + 5520 

TAAACAAATGTTTTGGTGAATACTGACCCACGCCACCGAGTGTGGACATTAGGGTCGTGA 



TTGGGAGGCTGAGGCGGGCGAATCATTTGAGGTGAGGAATTCGAGACCAGCGTGGCCAGC 

5521 — — 

AACCGTCCGACTCCGCCCGCTTAGTAAACTCCAGTCCTtAAGCTCTGGTCGGACCGGTCG 



ATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGGTGtGGTGGCACATGCC 
TACCACTTTGGGGTAGAGATGATTTTTATGTTTTTAATCGGTCCACACCACCGTGTACGG 



TGTAGTCCCAGCCACTAGGGCGG^TGAGACGGAAGACTTGCTTGAACCCGGGAGGCAGAG 

5641 ----- +— — — +- + .__♦_-----—-+—-------+ 5700 

ACATCAGGGTCGGTGATCCCGCCGACTCTGCGITCTGAACGAACTTGGGCCCTCCGTCTC 

GlTGCAGTGAGCCAAGATGGCGCCACTGGATTCCAGCCl'GGG^AACAGAGCAAGACCCTG 

TCTGTCTCXXXACAXXXAACAXAACCACTTATATTGCTA6CTACATTAAGAATTTCTGAA 

5 7 61 . +— - — ♦ — 5820 

AGACAGAGTTTTGTTTTTTGTTTTGGTGAATATAACGATC 

TATGTTACTGAGCTTGCTTGTGGTAACCATTTATAATATGAGAAAGTATATGTACACCM 

5321 ♦ ♦ * — — + 5880 

ATACAATGACTCGAACGAACACCATTGGTAAATATTATAGTCTTTCATATACATGTGGTT 

AACATGTTGAACATCCATGTTGTACAAC TGAAATATAAATAATTTTGTC AATT ATACCTA 

5BB1 4— ♦ — ♦ — + ♦ 5940 

TT G TAC AAC TT GTAGGTAC AACATGTTGACTTTATATTTATTAAAACAGTT AATATGGAT 

AAT AAAACTGGAAAAAAATTTCTGGAAG TTT ATATC T AAAAAT GT TAATAGTGC GTACCT 

5941 --4- — — ♦ -—4 _---4 ♦ ~ 4 6000 

TT ATTTTGACC TTT TTTT AAAGACC TT C AAATAT AGATTTTTAC AATTATCACGC ATGGA 

CTAGGAAGTGGGCCTGGAAGCCATTCTTACTTTTCAGTCTCTCCCATTCTGTACTGTTTT 
6001 4 ^ --—-4— — — - — -~-4— - »«p.«p — — 6060 

GATC C TTCAC C C GG AC CTTC GGT AAG AATGAAAAGTCAGAG AGGG TAAGACATGAC AXAA 

TTGTTTT ACTTTCGTGC C TGC AT T A TTT TT C TATTT AAAAC AAAAATAAAT C TAGTTT AG 

, , .4- ~-4 — -t — — + : ~— < — ♦ 6 120 

AAC AAAATGAAAGCACGGAC G TAAT AAAAAGATAAATT TTGTTT T TATTT AGA TC AAATC 

CACT 
€121 — 6124 
GTGA 
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TTCCGGCTGGACGTTGCCCTGTGTACCTCTTCGA^ 

^ ^ 4 — -+ + 60 

;juggccgacctgcaacgggAc^ 



GGGTATTGACCCCAGACAACAAra^ 

~---4 - + + + + 120 

CCCATAACTGGC&TCTGTTGTT^ 



i * 2^3 

TO^CTGGG&CCCTTCTGGACGI^^ 



-4~< 




4__„_.-L-4 180 



agtggaccctgggaagacctgcaacgggacactiqJgg^ 



ACGAACCCC^TCATTGACCCCAG^^ 

.^ + „„— « 4—, 4 4 — — -4. 240 

1XCTTGGGGCCCATAACTGGO 



3^4 

(^TTCCAAGGTCCATTCATTGCAAAGfn^ 

2 « 4 4 ^. + ™„.-_- + -- 300 

CTAAGGTTCCACGTAAGTAAC GTTT^AGGAATTTATAAAAGAGTGACGAAGGATGATTT 



GGACGGACAGAGCATTTC7TCTTCAGCCACATACTTTCCTTC 

„ ^ + _ 4-—-. 4— -*4 - ♦ 360 

CCTlXCTOTCTCGTAAACAAG^ 



4 5 

TCTATTAGACTAGAACTGTGGATAAACCTCAG^AAATGGCCA^ 



4 420 

AGATAATCTGATCTTCtf^ 



HATQQRA S D* 

ACGAGAGCSATCTCCCAGITTGATCAC^ 

421 — 4--..—+ — " — ♦ 480 

TGCTCTCCTAGAGGGTCAAACTAGTGTTAAACGACGGTCTC 



TAGATGC^GTTCAGTTGGCAAAGGJACTAGAAGAAGAGGAGC^ 

4 4—- — ^4— - T —4 — - — 4 540 

ATCTACGTCAAGTCAACCGTTTCCTT^ 

DAV QLAKE LEE E E Q X E R A- X K ~ 
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TGCAGAAAGGCTACAACTCTCAAATGCGCAGTGAAGCAAAAAGGOT 

+ + + — +-- + 600 

ACGTCrTTCCGATGTTGAGXGTrrACGCGTCACTTC 



541 



601 



QKGYNSQM R S E A K R t K T P V T - 

OTATGAGCCGTACAGCTCATC^TACCACAG^ ^ 
GAATAC TCGGCATCTCGW3TACCTATGGTGTCCTCTAC CGCCGGC0JACCCAAAATGXM3T 



Y E P Y S S W I P Q E M A A A G F^ Y F T - 

CTGGGGTAmTCTGGGATTCAGTGCTTCTC^ 

661 4 —-4 ♦ 720 

GACCCCATTTTAGACCCTAAGTCAC^ 

GVKS01QCFC C 8 L I It F 0 A G L - 

TCACGAGACTCCCCATAGAAGACCACAA£^GGTTTCATC 

7n 4— 4 +~ ~* 780 

AGTGCTCTGAGGGGTATCTTCTGGTG^ 

t R t» F I E D H K R F H P D C G F L L H 

ACAAGGATCTTGGTAACATOXrCAACT 

78i 4-— wo 

TG1TCCTACAACCATTGTAACGGTTCATGCTGTATTCCCACTTC 

KDVGHXAKYDI RV K H LK S R L 
TCAGAGGAGGTAAAATGAGGTACOAGAAGAGGAGGCTA 
ACTCTCCTCCATTTTACTCCATGGITCTTC 



841 * ^ * — *> 00 




R G G K M R Y Q E BEAR LAS F R N W - 
EcoRI 



GGCCATmATGTCCAAGGGATM&CCTTC 

. - :f— + — — 4 

CCGGTAAAATACAGGTTCC PTATAGp GGAACACAC GAGAGTCTCCGACCGAAACAGAAAT 




P FYVQGI SPCVLSEAGFVF1 1 - 
5 6 

^AAAttG(3£ACGGTACAGTGT^^ 

+ 4 + — 4 —4 1020 

G K Q 0 T V Q C F S C G G C L G H W E E - 

AAGGAGATCATCCTTGGAAGGAACATGCaU^TGG 
1 + + — -j—— *4 1080 

' TTCCTCTACTAGGAACCTTCCTT^ 



961 



G DDPWKEHAKWF PK CEFLRS 

GTAAGAAATCCTCAGAGGAAATTACCCACTATATTCAA^ 

CATTCTTTAGGAGTCTCCTTTAATGGG TCATATAAGTTTCGATGTTCCCTAAACAACTGT 

Fig. 7B 
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K K S S E E I T Q Y X Q S Y K G F V D I 

7 8 EcoRI ® ^ 

TAACGjGGAGAACATTTTGTpAA?^^ 



:GjGGAGAACATT^^ 

sqCCTC TTGTAAAACA ^TTAAGp XCCCAGGTC 



-4 1200 



ATTGqCCTCTTGTAAAACA ^^ 

? G E H F VK S W V Q R E LP MAS A T " 

ATTGCAATGACAGCATCTTrGCOT 

1201 4 —4 4 4 * 4 1260 

TAACGTTACTGTCGTAGAAACGAATGC TTCTTCATGCCQACCTGAGAAAATTCCTGACCG 

CUDS XFAYEELR LDS F K D W P - 

9 10 

CCC GGGAATCAGCTCTGGGAGTTGCAGCACTGGCCAAAGCAGGTC 

1261 — 4 ♦ -4 + 4 1320 

GGGC CCTTAGTCGACACCCTCAACGTC GTGACC GG^lVlCGTCCAGAAAAGATGTGTCpiT 

RES A V G V A A L A K A G L F Y T G I - 

TAAAGGACATCGTCCAGTCX71TTTCCTGTGGAGG 

1321 4- 4 + + + 4 1380 

ATTTCCTGTAGCAGGTCACGAAAA^ 

KDIVQCF SCGGC LEKWQ E G D - 

10 11 

ATGACC CATTAGACGATCACACCAGATGTTTTCCCA^ 

138 1 +~r— 4—- 4 -k-4™ ♦ 4 1440 

TACTGGGTAATCITCTACTGTOST^ 



DPLDDBTR-CF P N C PF L Q K M K - 

11 12 

AGTCCTCTGCGGAAGTGACTCCAGACCTTCAGAGCC 

1441 4 4 4— 4— 4- 4 1500 

TCAGGJU^GCCITOUC^^ 

SSA EVTPD LQS RGEX.C & LLE- 

12 . .13 

AAACCACAAGTGAAAGCAATCTTGAAGATO 

1501 4- 4— 4— 4 — + h-4 1560 

TTTGCT G WCACTTTCGTrAGAACTTCTAAgTt 

T T S E S M L ED S I AV GP IV P E M - 

TGGCACAGGGTGAAGCCCAGTGGTTTCAA(^GGCAAAGAATC 

1561 4 4 4 — 4— 4- 4 1620 
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